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POLYMORPHISM DETECTION AND SEPARATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of international application 
PCT/US02/09691, filed March 27, 2002, and a continuation-in-part of U.S. provisional 
5 application serial no. 60/325,828, filed September 28, 2001 , the disclosures of which are 
incorporated herein by reference in their entireties. 

BACKGROUND OF THE INVENTION 

A wide variety of methods have recently been developed to detect and 

1 o characterize single nucleotide polymorphisms (SNPs) within complex genomes. 

Each of these methods strives to satisfy a number of oft-competing needs: 
the need to identify the target locus uniquely within a complex sample (target specificity) 
with the need to distinguish among the polymorphic variants of the specified target (allelic 
selectivity); the need to provide specificity and selectivity at a single target with the need to 
1 5 query a large number of targets concurrently in a single multiplexed reaction; the need to 
provide a detectable signal from a small genomic or transcriptional sample with the need to 
avoid target amplification approaches that introduce spurious mutations; the need to query a 
duplex sample with reactions - such as hybridization, primer extension, and template-driven 
ligation - that require single-stranded substrates. 

2 0 Current approaches to SNP detection and characterization address these 

problems using a variety of reaction schemes, with varying degrees of success. Reviewed 
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in Kirk ef a/., NucL Acids Res. 30:3295-3311 (2002); Syvanen, Nature Reviews, Genetics 
2:930-942 (2001); Kwok, Annu. Rev. Genomics Hum. Genet 2:235-258 (2001). 

Recently, it has become possible to target single nucleotide changes 
directly into long pieces of genomic DNA, including YACs, BACs, and even intact cellular 
5 chromosomes using sequence-altering oligonucleotides. See WO 01/73002; WO 01/92512; 
and WO 02/10364. Although the frequency of targeted change is appreciable, it is still less 
than 100%, and there is thus a need for methods that permit the engineered variants to be 
separated from nucleic acids that differ by as few as one nucleotide therefrom. Yet none of 
the common techniques for SNP detection readily permits the direct purification or isolation 
10 of the variants so detected, particularly in their full length, unamplified, double-stranded 
form. 

Methods for isolating DNA analytes in double-stranded form are known. 
The methods use RecA protein, a critical component in the cellular process of homologous 
recombination. 

15 In the presence of ATP, RecA protein polymerizes on single-stranded DNA 

to form a right-handed helical nucleoprotein filament that is capable of invading duplex DNA 
in a sequence-independent fashion. Thereafter, in a process as yet incompletely 
understood, the RecA coated nucleoprotein filament searches along the duplex for 
homologous sequence. Once homologous sequence is located, RecA mediates further 

2 o • ATP-dependent reactions necessary to effect strand exchange and recombination. In the 
presence of a non-hydrolyzable ATP analogue, such as ATP-yS, the process halts after 
regions of homology have been found, with the nucleoprotein filament bound to the duplex 
in a joint structure commonly termed a displacement loop ("D-loop"). The strand of the 
duplex displaced by the RecA filament is available to bind a second single-stranded nucleic 

2 5 acid; the four-stranded joint structure so formed is termed a double D-loop ("double D-loop", 

"dDloop"). 

U.S. Patent No. 5,670,316 describes methods of isolating DNA duplexes by 
using £ coliRecA protein to anchor complementary polynucleotide probes within a double 
displacement loop formed at a desired target sequence; at least one of the probes has a 

3 o moiety that permits the subsequent specific capture and isolation of the double D-loop- 

containing duplex. 
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The RecA filament homology search is permissive of mismatches, 
however; the in vivo manifestation of this phenomenon is the well known tolerance of 
homologous recombination for the presence of heterologous sequence. 

Accordingly, the methods of isolation described in U.S. Patent No. 
5,670,316 are said to tolerate up to 30% mismatch between probes and target duplex. 
Analogous dDloop isolation methods described in WO 02/10457 are said to tolerate 60% 
mismatch as between probe and target, although no more than 30% sequence mismatch is 
said to be preferred. 

Although RecA-mediated purification procedures permit separation of 
double-stranded targets, tolerance of mismatches between probe and target precludes the 
separation and isolation of polymorphic variants that differ by a single nucleotide. 

Accordingly, there remains a need in the artforgenotyping methods that 
permit single nucleotide polymorphisms readily to be detected within a complex genome, 
and that permit specific polymorphic variants so detected to be isolated in double-stranded 
form from variants that may differ by as few as one nucleotide therefrom. 



20 
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SUMMARY OF THE INVENTION 

The present invention solves these and other needs in the art by providing 
methods, compositions, and kits for distinguishing the presence of one or more double- 
stranded nucleic acid targets present within a sample of nucleic acids from variants that can 
differ by as few as one nucleotide therefrom, and methods that optionally permit such 
targets selectively to be separated and purified from such samples. 

The methods are readily multiplexed, permitting a large number of loci to 
be screened within a single sample, may be adapted to a variety of existing detection 
systems, and permit target amplification without PCR, increasing fidelity. The ability to 
separate desired double stranded targets with allelic selectivity, with or without 
contemporaneous detection, offers significant advantages over current genotyping methods. 

The invention is based in part upon the development of reaction conditions 
under which the paradigmatic mismatch tolerance of RecA-mediated D-loop formation does 
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not act to prohibit single nucleotide mismatch discrimination, Preferred reaction conditions 
can differ depending upon the topological state of the target. 

In a first aspect, therefore, the invention provides a method for 
distinguishing the presence of a nonsupercoiled target nucleic acid from the presence of 
5 nonsupercoiled target variants within a sample of nucleic acids, the variants differing from 
the target by as few as one nucleotide within a common target query region. 

The method comprises using a recombinase to mediate formation of at 
least one deproteinization-stable double D-loop in the query region of the target, under 
conditions that favor double D-loop formation at the target query region over formation at 

1 o variants that differ from the target by as few as one nucleotide therefrom, and then 

distinguishing the degree of formation of double D loops that are stable to deproteinization. 
A greater degree of formation distinguishes the presence of target from that of variants. 

In typical embodiments, the double D-loop formation conditions comprise 
contacting the nucleic acid sample with a first oligonucleotide and a second oligonucleotide 
15 (hereinafter also denominated "incoming" and "annealing" oligonucleotides, respectively). 
The first oligonucleotide may be contacted to sample before the second oligonucleotide or 
contemporaneously therewith. 

The first oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a first strand of the target across the entirety of the target 

2 o query region. The second oligonucleotide includes a complementarity region that is 

perfectly complementary in sequence to a second strand of the target across at least a 
portion of the target query region. And at least one of the first or second oligonucleotide 
complementarity regions is imperfectly complementary to the respective first or second 
strands of the query region of each of the target variants desired selectively to be 

2 5 discriminated from target. 

The first oligonucleotide is bound by a recombinase. The second 
oligonucleotide comprises base modifications and does not substantially bind the 
recombinase. And at least one of the first and second oligonucleotides is distinguishably 
detectable. 

3 0 In typical embodiments, the first oligonucleotide is no more than about 100 

nucleotides in total length, typically no more than 50 nucleotides in length; the first 
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oligonucleotide complementarity region is thus usually no more than 100 nucleotides in 
length, typically no more than 50 nucleotides in length, and can be smaller. 

The second oligonucleotide base modifications can be selected from the 
group consisting of LNA bases, PNA bases, RNA bases, and Z-OMe bases. Typically, the 
5 second oligonucleotide includes at least 30% modified bases, often at least 50% modified 
bases, and be include at least 75% modified bases or more. 

The second oligonucleotide, in certain embodiments, is no more than 50 nt 
in total length, typically no more than 25 nt in total length, and at times as short as 20 nt, 
and even 15 - 16 nt in total length. The second second oligonucleotide complementarity 

1 o region is frequently no more than 50 nucleotides in length, 25 nt in length, and in certain 

embodiments is as short as 15 - 16 nt in length. 

Typically, the first and second oligonucleotide complementarity regions 
overlap by no more than 25 nt, often by no more than 15 nt. 

Usefully, at least one of the first and second oligonucleotides includes at 
1 5 least one detectabe label. In certain embodiments, the label is selected from the group 
consisting of a radionuclide, a fluorophore, a fluorescence resonance energy transfer 
tandem fluorophore, a fluorescence resonance energy transfer donor, a fluorescence 
resonance energy transfer acceptor, a mass tag, an enzyme, a genotypic label ("bar code 
tag"), or a hapten, with fluorophores and genotypic barcode tags having particular utility. 

2 0 The contacting step may be performed, in certain embodiments, at a 

temperature of at least about 37°C, 45°C, 50°C, and even at least about 55°C. 

The methods of the present invention may further comprise the step, after 
forming and before distinguishing the degree of formation of the double D loops, of 
deproteinizing the nucleic acids of said sample. 

2 5 Deproteinization may be performed, for example, at a temperature of at 

least 37°C, and for no more than about 10 minutes. 

The double D loops may usefully be stable for a time following 
deproteinization sufficient to permit detectable separation of target from target variants. 
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For example, in certain embodiments the double D loops are stable for at 
least 2 hours at 4°C following deproteinization, at least 4 hours at 4°C following 
deproteinization, and even for at least 30 minutes at 37°C following deproteinization. 

The nonsupercoiled double-stranded target in the methods of this aspect of 
5 the present invention may be linear duplex DNA, a covalently closed circle, or a nicked 
circle, among others. 

The target may be within a nucleic acid preparation lacking vector 
sequences or, in the alternative or in addition, be within a nucleic acid preparation having 
vector sequences. 

1 o The target may, for example, be within an artificial chromosome, and the 

target query region may usefully be flanked in such artificial chromosome by recognition 
sites for a site-specific recombinase. 

The nucleic acids of the sample to be queried may be pooled from a 
plurality of individuals or drawn from a single individual. 

is In a series of embodiments, the nucleic acid sample includes at least one 

variant that differs from the target by no more than one nucleotide in the query region. The 
variants that are concurrently present may be naturally-occurring allelic variants of the 
target, somatically mutated variants of the target, or recombinantly-engineered variants of 
the target. 

2 o The method may further comprise the step, after deproteinizing and before 

distinguishing the double D loops, of separating the nucleic acids that have double D loops 
from nucleic acids lacking double D loops. Usefully, at least one of the first and second 
oligonucleotides is tethered to a solid support or includes a capture moiety, such as biotin. 

The method may further comprise the step, after double D loop formation, of 
2 5 extending by polymerase either or both of the first or second oligonucleotides, either by a 
single base, or by multiple bases, the latter permitting allele-specific amplification of the 
target query region. Amplification may be isothermal or thermal cycling. 

The method may further comprise the step, after double D loop formation, of 
selectively cleaving either the target or variants thereof. 
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The method may further comprising the step of quantifying the absolute or 
relative abundance of target. 

In a second aspect, the invention provides a method of distinguishing the 
presence of a supercoiled target nucleic acid from the presence of supercoiled target 
variants within a sample of nucleic acids, the variants differing from the target by as few as 
one nucleotide within a common target query region. 

The method comprises using a recombinase to mediate formation of at 
least one deproteinization-stable single D-loop or double D-loop in the query region of the 
target, under conditions that favor formation at the target query region over formation at 
variants that differ from the target by as few as one nucleotide therefrom, and then 
distinguishing the degree of formation of single D-loops or double-D loops that are stable to 
deproteinization, a greater degree of formation distinguishing the presence of target from 
that of variants. 

Formation conditions in this aspect of the invention typically comprise 
contacting the sample with at least a first oligonucleotide to form at least a single D-loop at 
the target query region. 

The first oligonucleotide is bound by a recombinase, and includes a 
complementarity region that is (i) perfectly complementary in sequence to a first strand of 
the target across the entirety of the target query region and (ii) imperfectly complementary 
to a first strand of the query region of each of the target variants desired to be distinguished . 

The method may further comprise contacting the sample with a second, 
"annealing", oligonucleotide to effect formation of a double D-loop at the target query 
region. The second oligonucleotide comprises base modifications, does not substantially 
bind recombinase, and includes a complementarity region that is perfectly complementary in 
sequence to a second strand of the target across at least a portion of the target query 



In these latter, two oligonucleotide, double D-loop embodiments, at least 
one of the first and second oligonucleotide complementarity regions is imperfectly 
complementary to a respective strand of the query region of each of the target variants, and 
the second oligonucleotide is typically distinguishably detectable. 
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In certain embodiments, the method may further comprise contacting the 
sample with a third oligonucleotide. The third oligonucleotide comprises base modifications 
and does not substantially bind the recombinase. It includes a complementarity region that 
is perfectly complementary in sequence to at least a portion of the second strand of the 
5 query region of a target variant as to which the target is desired to be discriminated, and 
that is imperfectly complementary in sequence to the complementarity region of said first 
oligonucleotide. 

In another aspect, the invention provides methods of distinguishabiy 
detecting the presence of a plurality of nonsupercoiled targets within a sample of nucleic 
l o acids with selectivity sufficient to distinguish each of the plurality of targets from variants that 
respectively differ by as few as one nucleotide therefrom at a query region that is common 
therebetween. 

The method comprises using a recombinase to mediate formation, 
separately for each of the plurality of targets desired to be detected, of at least one 
1 5 deproteinization-stable double D loop in the target's query region, under conditions that 

favor double D loop formation at the target query region over formation at variants that differ 
from the target by as few as one nucleotide therefrom, each target's double D-loop being 
distinguishabiy detectable from all others of the double D-loops formed in said sample; and 
then distinguishabiy detecting each of the stable double-D loops so formed. 

20 In certain embodiments, the formation conditions comprise contacting the 

sample, for each of the plurality of targets desired to be detected, with a first oligonucleotide 
and a second oligonucleotide: the first oligonucleotide includes a complementarity region 
that is perfectly complementary in sequence to a first strand of its respective target across 
the entirety of the target query region; the second oligonucleotide includes a 

25 complementarity region that is perfectly complementary in sequence to a second strand of 
the same target across at least a portion of the target query region; and either or both of the 
oligonucleotide regions is imperfectly complementary in sequence to respective first and 
second strands of the query region of each of the other targets desired discriminably to be 
detected. 



30 



In these embodiments, the first oligonucleotide is bound by a recombinase 
and the second oligonucleotide comprises base modifications and does not substantially 
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bind said recombinase, and at least one of the oligonucleotides is distinguishable from the 
first and second oligonucleotides used to detect each of the others of the plurality of targets 
desired to be detected. 

In another aspect, the invention provides a method of distinguishably 
detecting the presence of a plurality of supercoited targets within a sample of nucleic acids 
with selectivity sufficient to distinguish each of the plurality of targets from variants that 
respectively differ by as few as one nucleotide therefrom at a query region that is common 
therebetween. 

The method comprises using a recombinase to mediate formation 
separately for each of the plurality of targets desired to be detected, of at least one 
deproteinization-stable single- or double-D loop in the target's query region under 
conditions that favor single- or double-D loop formation at the target query region over 
formation at variants that differ from the target by as few as one nucleotide therefrom each 
targets s.ngle- or double D-loop being distinguishably detectable from all others of the D- 
ioops formed in the sample; and then distinguishably detecting each of the stable double-D 
loops so formed. 

The formation conditions may usefully comprise contacting the sample for 
each of the plurality of targets desired to be detected, with a first oligonucleotide, wherein 
the first oligonucleotide includes a complementarity region that is perfectly complementary 
m sequence to a first strand of its respective target across the entirety of the target query 
region and imperfectly complementary in sequence to a first strand of the query region of 
each of the other targets desired discriminably to be detected, wherein the first 
oligonucleotide is bound by a recombinase and is distinguishable from the first 
oligonucleotide used to detect each of the others of the plurality of targets desired to be 
detected. 

In other embodiments, the method further comprises contacting the 
sample, for each of the plurality of targets desired to be detected, with a second 
olrgonucleotide, wherein the second oligonucleotide includes a complementarity region that 
■ Perfectly complementary in sequence to a second strand of the target across at least a 
portion of the target query region, wherein at least one of the first and second 
oUgonucleotide complementarity regions is imperfectly complementary to the respective 
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strand of the query region of each of the target variants, wherein the second oligonucleotide 
comprises base modifications and does not substantially bind said recombinase, and 
wherein the second oligonucleotide is distinguishably detectable. 

Usefully, the methods of distinguishably detecting the presence of a 
5 plurality of supercoiled targets within a sample of nucleic acids, with selectivity sufficient to 
distinguish each of the plurality of targets from variants that respectively differ by as few as 
one nucleotide therefrom at a query region that is common therebetween, may further 
comprise quantifying the relative abundance of each of said targets. 

In certain embodiments, at least 10, 50, and as many as 100-10,000 

1 o targets are discriminably detected, either seriatim or concurrently. Usefully, concurrent 

detection is by microarray hybridization. 

In another aspect, the invention provides methods of separating a double- 
stranded nucleic acid target from other nonsupercoiled nucleic acids present within a 
sample of nucleic acids, with selectivity sufficient to separate the target from variants that 
1 5 differ from the target by as few as one nucleotide within a common target query region. 

The method comprises using a recombinase to mediate formation of at 
least one deproteinization-stable double D loop in the query region of the target, under 
conditions that favor double D loop formation at the target query region over formation at 
variants that differ from the target by as few as one nucleotide thereof; and then separating 

2 o nucleic acids having deproteinization-stable double D loops from other nucleic acids present 

within the sample. 

Typical formation conditions comprise contacting the sample with a first 
oligonucleotide and a second oligonucleotide, wherein the first oligonucleotide includes a 
complementarity region that is perfectly complementary in sequence to a first strand of the 

2 5 target across the entirety of the target query region, the second oligonucleotide includes a 

complementarity region that is perfectly complementary in sequence to a second strand of 
the target across at least a portion of the target query region, and at least one of the first or 
second oligonucleotide complementarity regions is imperfectly complementary to the 
respective first or second strands of the query region of each of said target variants. The 

3 o first oligonucleotide is bound by a recombinase and the second oligonucleotide comprises 

base modifications and does not substantially bind said recombinase. 
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In some embodiments, at least one of the first and second oligonucleotides 
includes a capture moiety and the nucleic acids having deproteinization-stable double D 
loops are separated from other nucleic acids present within the sample by capture of the 
moiety. In certain of these embodiments, the moiety is captured to a solid substrate, such 
as a magnetic bead. 

The method can effect purification of the target, with single nucleotide 
selectivity, of at least 10-fold, and even at least 100-fold, 1000-fold, to 10,000-fold to 106- 
fold or more. 

In another aspect, the invention provides nucleic acid compositions 
characterized by the presence of at least one deproteinized double D loop at a query region 
within a nucleic acid target. The deproteinized double D-loop includes a first and a second 
oligonucleotide, wherein the first oligonucleotide includes a complementarity region that is 
perfectly complementary in sequence to a first strand of the target across the entirety of the 
target query region, wherein the second oligonucleotide includes a complementarity region 
that is perfectly complementary in sequence to a second strand of the target across at least 
a portion of the target query region, wherein the first oligonucleotide is bound by a 
recombinase, wherein said second oligonucleotide comprises base modifications and does 
not substantially bind said recombinase, and wherein at least one of the first and second 
oligonucleotides is distinguishably detectable. 

In some embodiments, the composition further comprises at least one 
variant that differs from the target by as few as one nucleotide within a query region that is 
common therebetween, wherein the query region of each of the at least one target variants 
lacks a double D loop. 

In another aspect, the invention provides nucleic acid compositions 
characterized by the presence of a plurality of deproteinized double D-loops, each of the 
plurality formed at a query region within a respective nucleic acid target, wherein each 
double D-loop includes a first and a second oligonucleotide, wherein the first oligonucleotide 
includes a complementarity region that is perfectly complementary in sequence to a first 
strand of the target across the entirety of the target query region, wherein the second 
oligonucleotide includes a complementarity region that is perfectly complementary in 
sequence to a second strand of the target across at least a portion of the target query 
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region, wherein the first oligonucleotide is bound by a recombinase, and wherein the second 
oligonucleotide comprises base modifications and does not substantially bind said 
recombinase. 

At least two of the plurality of targets may differ in sequence by 1 - 1 0 
5 nucleotides as between their respective query regions, and in some embodiments, at least 
two of said plurality of targets differ in sequence by exactly 1 nucleotide as between their 
respective query regions. 

The composition may include at least 10, 100, 1000, even 10,000 targets, 
either supercoiled or nonsupercoiled. 

io In another aspect, the invention provides a kit for distinguishing the 

presence of a target nucleic acid from the presence of target variants within a sample of 
nucleic acids, the variants differing from the target by as few as one nucleotide within a 
common target query region. 

The kit comprises a first oligonucleotide, a second oligonucleotide, and a 

1 5 RecA-like recombinase. 

The first oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a first strand of the target across the entirety of the target 
query region. The second oligonucleotide includes a complementarity region that is 
perfectly complementary in sequence to a second strand of the target across at least a 

2 o portion of the target query region. And at least one of the first or second oligonucleotide 
complementarity regions is imperfectly complementary to the respective first or second 
strands of the query region of each of the target variants. 

The first oligonucleotide is capable of being bound by a recombinase and 
the second oligonucleotide comprises base modifications and is incapable of substantially 
2 5 binding said recombinase. At least one of the first and second oligonucleotides is 
distinguishably detectable. 

In some embodiments, the first oligonucleotide and RecA are combined in 
a single composition. In typical embodiments, at least one of the first and second 
oligonucleotides is detectably labeled. 
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In another aspect, the invention provides a kit for separately distinguishing 
the presence of a plurality of targets within a nucleic acid sample, with selectivity sufficient 
to distinguish each of the plurality of targets from variants that respectively differ by as few 
as one nucleotide therefrom at a query region that is common therebetween. 

The kit comprises a RecA-like recombinase, and, for each target desired to 
be separately to be distinguished, a pair of first and second oligonucleotides. 

The first oligonucleotide of the pair includes a complementarity region that 
is perfectly complementary in sequence to a first strand of its respective target across the 
entirety of the target query region. The second oligonucleotide of each pair includes a 
complementarity region that is perfectly complementary in sequence to a second strand of 
the same target across at least a portion of the target query region. Either or both of the 
oligonucleotide complementarity regions is imperfectly complementary in sequence to 
respective first and second strands of the query region of each of the other targets desired 
discriminably to be detected. 

The first oligonucleotide is capable of binding a recA-like recombinase and 
the second oligonucleotide comprises base modifications and is incapable of substantially 
binding a RecA-like recombinase, and at least one of The oligonucleotides is distinguishable 
from the first and second oligonucleotides used to detect each of the others of the plurality 
of targets desired to be detected. 

In some embodiments, each of said first oligonucleotides is combined with 
RecA, either in a separate composition, or in a single, common, composition. 

Typically, at least one of the first and second oligonucleotides of each 
target pair is detectably labeled, the label of each target pair being distinguishable from that 
of each other target pair. Usefully, the label may be a fluorescent label or a genotypic label. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The above and other objects and advantages of the present invention will 
be apparent upon consideration of the following detailed description, taken in conjunction 
with the accompanying drawings, in which like reference characters refer to like parts 
throughout, and in which: 
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FIG. 1 is a flow diagram for the generation of double D-loops according to 
embodiments of the methods of the present invention; 

FIG. 2 is an autoradiogram showing double D-loop formation on a 
radiolabeled linear target under various conditions, according to embodiments of the 
5 present invention; 

FIG. 3 is an autoradiogram showing double D-loop formation on a 
radiolabeled linear target under various conditions, according to embodiments of the 
present invention; 

FIG. 4 is an autoradiogram showing double D-loop formation on a 

1 o radiolabeled linear targetrunder various conditions, according to embodiments of the 

present invention; 

FIG. 5 is an autoradiogram showing temperature effects on double D-loop 
formation, according to embodiments of the present invention; 

FIG. 6 shows Y-loop (Y-arm) formation over a range of temperatures, 
1 5 according to embodiments of the present invention; 

FIG. 7 shows double D-loop formation over a range of times, in minutes, 
according to embodiments of the present invention; 

FIG. 8 shows Y-anm formation over a range of times, in minutes, according 
to embodiments of the present invention; 

' 2 o FIG. 9 shows Y-arm formation with varying incoming and annealing 

oligonucleotides, according to embodiments of the present invention; 

FIG. 10 shows Y-arm formation with varying incoming and annealing 
oligonucleotides, as indicated, according to embodiments of the present invention; 

FIG. 11 shows double D-loop formation with annealing oligonucleotides as 

2 5 indicated, according to embodiments of the present invention; 

FIG. 12 shows double D-loop formation using annealing oligonucleotides 
as indicated, according to embodiments of the present invention; 



WO 03/027640 



15 



PCT/US02/31073 



S 



FIG. 13 shows the oligonucleotide sequence [SEQ ID NO: 37] of the Kan- 
PCR product used as a target for double D-loop formation in embodiments of the present 
invention; 

FIG. 14 shows the efficiency of double D-loop formation using the Kan- 
PCR product as the target, with efficiency normalized to double D-loop formation efficiency 
with KL02 (indicated in the figure as LNA-15mer), according to embodiments of the present 
invention; 

FIG. 15. shows the oligonucleotide sequence of the Hyg- PCR product 
[SEQ ID NO: 1 15] used as a target for double D-loop formation and as non-specific 
competitor DNA in Example 12; 

FIG. 16 shows the efficiency of double D-loop formation according to 
embodiments of the present invention, using the Hyg- PCR product as the target, with 
efficiency normalized to double D-loop formation efficiency with Hyg15T; 

FIG, 17 shows double D-loop formation according to embodiments of the 
present invention in yeast genomic DNA from a strain with an integrated copy of a Hyg- 
target and from a strain with an integrated copy of a Hyg + target; 

FIGS. 18A and 1 8B are two color fluorescence scans of electrophoresis 
gels showing single nucleotide mismatch discrimination in an 8 kb plasmid target using PNA 
annealing oligonucleotides according to embodiments of the present invention; 

FIG. 19 is a two color fluorescence scan of an electrophoresis gel showing 
single nucleotide mismatch discrimination in an 8 kb plasmid target using LNA annealing 
oligonucleotides according to embodiments of the present invention; 

FIG. 20 is a two color fluorescence scan of an electrophoresis gel showing 
single nucleotide mismatch discrimination in an 8 kb plasmid target using LNA annealing 
oligonucleotides according to embodiments of the present invention, with temperatures as 
indicated; 

FIG. 21 is a two color fluorescence scan showing single nucleotide 
mismatch discrimination in a linear target without the presence of competing 
oligonucleotides, according to embodiments of the present invention; 
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FIG. 22 is a single color fluorescence scan demonstrating that either the 
incoming or annealing oligonucleotides may be labeled in the methods of the present 
invention; 

FIG. 23 is a single color fluorescence scan showing that minimal formation 
5 of double D-loops (indicated by arrows) is seen with incubations below about 37°C, with 
optimal formation observed from 50 to about 55°C using a 45-mer incoming and 15-mer 
LNA-containing annealing oligonucleotide according to embodiments of the present 
ivnention; 

FIG. 24 is a two color fluorescence scan showing mismatch discrimination 

1 o using competing and distinguishably labeled annealing oligonucleotides, according to 

embodiments of the present invention; 

FIG. 25 is a two color fluorescence scan showing the size-dependence of 
mismatch discrimination according to embodiments of the present invention; 

FIG. 26 is a single color fluorescence scan demonstrating detection of 
1 5 double D-loops down to 1 .0 ng of an 8 kb linear target DNA without signal amplification; 

FIGS. 27A and 27B show detection of human beta tubulin PCR product 
using double D-loop formation; 

FIG. 28 is a table summarizing data demonstrating gene-specific 
separation of plasmid clones; 

2 o FIG. 29 is a table summarizing data demonstrating variant-selective DNA 

separation; and 

FIG. 30 is a two color fluorescence scan of double D-loops formed with 
oligonucleotides labeled as shown, 

25 DETAILED DESCRIPTION 

We have developed reaction conditions under which the paradigmatic 
mismatch tolerance of RecA-mediated D-loop formation does not act to prohibit single 
nucleotide mismatch discrimination. Using these conditions, we have been able for the first 
time to develop D-loop and double D-loop based methods for detecting and separating 
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double-stranded nucleic acid targets that differ from polymorphic variants by as few as a 
single nucleotide. The methods are readily multiplexed, permitting a large number of loci to 
be screened within a single sample, may be adapted to a variety of existing detection 
systems, and permit target amplification without PCR, increasing fidelity. The ability to 
5 separate selected double stranded variants, with or without contemporaneous detection, 
offers significant advantages over current genotyping methods, and finds particular utility in 
facilitating the construction of coisogenic cell collections in which the cells differ 
genotypically by single nucleotide changes targeted to defined loci. 

Preferred reaction conditions can differ depending upon the topological 
10 state of the target. 

In a first aspect, therefore, the invention provides a method for 
distinguishing the presence of a nonsupercoiled target nucleic acid from the presence of 
nonsupercoiled target variants within a sample of nucleic acids, the variants differing from 
the target by as few as one nucleotide within a common target query region. 

1 5 The method comprises using a recombinase to mediate formation of at 

least one deproteinization-stable double D-loop in the query region of the target, under 
conditions that favor double D-loop formation at the target query region over formation at 
variants that differ from the target by as few as one nucleotide therefrom, and then 
distinguishing the degree of formation of double D loops that are stable to deproteinization. 

20 A greater degree of formation distinguishes the presence of target from that of variants. 

In typical embodiments, the double D-loop formation conditions comprise 
contacting the nucleic acid sample with a first oligonucleotide and a second oligonucleotide 
(hereinafter also denominated "incoming" and "annealing" oligonucleotides, respectively). 
The first oligonucleotide may be contacted to sample before the second oligonucleotide or 

2 5 contemporaneously therewith . 

The first oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a first strand of the target across the entirety of the target 
query region. The second oligonucleotide includes a complementarity region that is 
perfectly complementary in sequence to a second strand of the target across at least a 

3 o portion of the target query region. And at least one of the first or second oligonucleotide 

complementarity regions is imperfectly complementary to the respective first or second 
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strands of the query region of each of the target variants desired selectively to be 
discriminated from target. 

The first oligonucleotide is bound by a recombinase. The second 
oligonucleotide comprises base modifications and does not substantially bind the 
5 recombinase. And at least one of the first and second oligonucleotides is distinguishably 
detectable. 

In a second aspect, the invention provides a method of distinguishing the 
presence of a supercoiled target nucleic acid from the presence of supercoiled target 
variants within a sample of nucleic acids, the variants differing from the target by as few as 

1 o one nucleotide within a common target query region. 

The method comprises using a recombinase to mediate formation of at 
least one deproteinization-stable single D-loop or double D-loop in the query region of the 
target, under conditions that favor formation at the target query region over formation at 
variants that differ from the target by as few as one nucleotide therefrom, and then 
1 5 distinguishing the degree of formation of single D-toops or double-D loops that are stable to 
deproteinization, a greater degree of formation distinguishing the presence of target from 
that of variants. 

Formation conditions in this aspect of the invention typically comprise 
contacting the sample with at least a first oligonucleotide to form at least a single D-loop at 
20 the target query region. 

The first oligonucleotide is bound by a recombinase, and includes a 
complementarity region that is (i) perfectly complementary in sequence to a first strand of 
the target across the entirety of the target query region and (ii) imperfectly complementary 
to a first strand of the query region of each of the target variants desired to be distinguished. 

2 5 The method may further comprise contacting the sample with a second, 

"annealing", oligonucleotide to effect formation of a double D-loop at the target query 
region. The second oligonucleotide comprises base modifications, does not substantially 
bind recombinase, and includes a complementarity region that is perfectly complementary in 
sequence to a second strand of the target across at least a portion of the target query 
30 region. 
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In these latter, two oligonucleotide, double D-loop embodiments, at least 
one of the first and second oligonucleotide complementarity regions is imperfectly 
complementary to a respective strand of the query region of each of the target variants, and 
the second oligonucleotide is typically distinguishably detectable. 

In the methods of the present invention, whether for distinguishing 
nonsupercoiled or supercoiled targets from variants thereof, the recombinase is a RecA-like 
recombinase. 

The best known RecA-like recombinase is the RecA protein from £ coli, 
which is available commercially (Roche Applied Science, Indianapolis, IN USA). 

RecA orthologues have also been identified in a wide variety of prokaryotic 
genera, including Bacillus (B. halodurans, GenBank accession no. NP_243249), 
Streptomyces (S. agalactiae, GenBank accession no. NP_689079; S. pyogenes MGAS315, 
GenBank accession no. NP_665604); Staphylococcus (S. aureus subsp. Aureus MW2, 
GenBank accession no. NP_645985); Brucella (B. melitensis, GenBank accession no. 
NP_539704); Helicobacter (H. pylori, GenBank accession no. NP_206952); 
Corynebacterium (C. glutamicum, GenBank accession no. NPJ01162); Bordetella (B. 
hinzii, GenBank accession no. MM92267); Bacteroides (B. fragilis, GenBank accession no. 
AAK58827); Haemophilus (H. influenzae, GenBank accession no. AAM91954); 
archaebacteria (Reich etai, Extremophiles 5(4):265-75 (2001); and others. 

In eukaryotes, RecA orthologues are typically members of the Rad51 
family, Shibata etal, Proc. Natl. Acad. Sci. USA 98:8425-8432 (2001), members of which 
have been identified in a wide variety of eukaryotic organisms including yeasts, such as 
Saccharomyces cerevisiae (GenBank accession no. NP_011021), Drosophila melanogaster 
(GenBank accession no. Q27297), Caenorhabditis elegans (GenBank accession no. 
BAA24982), and homo sap/ensfYoshimura ef a/., "Cloning and sequence of the human 
RecA-like gene cDNA," Nucl. Acids Res. 21(7):1665 (1993); GenBank accession no. 
NP_002866). 

Allelic variants and engineered mutants (collectively, muteins) of RecA and 
of its orthologues have also been described, including recA-803, Madiraju etai, Proc. Natl. 
Acad. Sci. USA 85(18):6592-6 (1988). 
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For purposes of the present invention, the phrases "recombinase" and 
"RecA-like recombinase" include all such RecA orthologues and variants that are capable of 
forming nucleoprotein filaments with single-stranded DNA and thereafter mediating 
sequence-specific D-loop formation. Functional tests for the sufficiency of any such 
5 candidate recombinase for use in the methods of the present invention may conveniently 
include substitution of the candidate protein for £ coli RecA in the assays further described 
in the Examples below. 

The first, or "incoming" oligonucleotide, is bound by the recombinase (so 
bound, the oligonucleotide is typically referred to as a nucleoprotein filament or RecA 

1 o filament). Accordingly, the incoming oligonucleotide is composed of nucleobases that do 

not preclude RecA binding, typically natural deoxyribonucleosides in typical phosphodiester 
linkage. 

In some embodiments, the methods of the present invention include the 
antecedent step of binding RecA to the incoming oligonucleotide to form a nucleoprotein 
15 filament. 

Nucleoprotein filament formation is typically performed in the presence of at 
least one co-factor, such as ATPyS, GTPyS, a mixture of ATPyS and rATP, or rATP alone 
in the presence of a rATP regenerating system. In a particularly useful embodiment, the 
RecA protein coating reactions of the methods of the present invention are carried out using 

2 0 ATPyS. 

Conditions for creating RecA filaments are well established for £ coli 
RecA; representative conditions are set forth in the Examples herein below. Conditions for 
RecA mutants and RecA orthologues may be determined by routine experimentation. In 
one approach, the mutant or orthologue may be substituted for RecA in the strand 

2 5 exchange activity assay described in Cox et al., Proc. Natl. Acad. Sci. USA 78:3433 (1981), 

the disclosure of which is incorporated herein by reference in its entirety, and reaction 
conditions routinely altered to determine optimal conditions. In another approach, the 
mutant or orthologue may be substituted for RecA in any of the assays set forth in the 
Examples below, and reaction conditions routinely altered to determine optimal conditions. 

3 o The first oligonucleotide is typically single-stranded, and is typically at least 

about 25, 26, 27, 28, 29, or 30 nucleotides in length, may be about 35, 40, 45, or 50 
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15 



20 



25 



30 



nucleotides in length, and may even be longer, including 75, 100, even 200, 300, 400, or 
500 nt in length. The first oligonucleotide is typically less than about 100 nt in length, often 
less than about 75 nt in length, and often less than about 50, 49, 48, 47, 46, 45, 44, 43, 42, 
41 or 40 nt in length, with lengths of about 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41*. 42, 
43, 44, 45, 46, 47, 48, 49, or 50 nt being typical. 

The first oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a first strand of the target across the entirety of the target 
query region. 

The target query region is that region of the target as to which selectivity 
among polymorphic variants of the target is desired. It is defined by the first oligonucleotide 
and is that region of the target first strand that is perfectly complementary in sequence, in 
the Watson-Crick (as contrasted to Hoogstein) sense, to at least a portion of the sequence 
of the first oligonucleotide. 

The complementarity region of the first oligonucleotide (and by definition 
the target query region) is at least about 25, 26, 27, 28, 29, or 30 nucleotides in length may 
be at least about 35, 40, 45, or 50 nucleotides in length, and may even be longer, including 
75, 100, even 200, 300, 400, or 500 nt in length. Typically, the complementarity region is 
less than about 100 nt in length, often less than about 75 nt in length, and often less than 
about 50, 49, 48, 47, 46, 45, 44, 43, 42, 41 or 40 nt in length, with lengths of about 30 31 
32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42. 43, 44, 45, 46. 47, 48, 49, or 50 nt being typical." 

The first oligonucleotide may include regions additional to the 
complementarity region. 

As further discussed below, such regions may usefully be used, for 
example, to provide genotypic labels ("bar codes", or "tags") that uniquely identify the 
oligonucleotide; to provide priming sites for oligonucleotide-mediated amplification 
reactions; to provide phage promoter sites for generating RNA transcripts; and to provide 
restriction sites to facilitate subsequent cloning of the target query region. 

The second, or "annealing", oligonucleotide comprises base modifications 
and does not substantially bind the recombinase. The inability substantially to bind 
recombinase is typically, but need not invariably, be due to the presence of the modified 
bases. 
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Among base modifications usefully included in the second oligonucleotide 
are locked nucleic acid ("LNA") residues, ribonucleic acid residues, such as 2'0Me 
residues, or peptide nucleic acid (PNA) residues. Other modified bases that may be used 
include, for example, 2-aminoadenine and cytosine/uracil substituted at the 5 position with a 
methyl, propynyl or bromo group. 

LNAs are bicyclic and tricyclic nucleoside and nucleotide analogs and the 
oligonucleotides that contain such analogs. The basic structural and functional 
characteristics of LNAs and related analogues that usefully may be incorporated into the 
second ("annealing") oligonucleotide in the methods of the present invention are disclosed 
in various publications and patents, including WO 99/14226, WO 00/56748, WO 00/66604, 
WO 98/39352, U.S. Patent No. 6,043,060, and U.S. Patent No. 6,268,490, the disclosures 
of which are incorporated herein by reference in their entireties. 

Among such LNAs that may usefully be incorporated into the second 
oligonucleotide are those described by the following formula: 



15 




wherein X is selected from -0-, -S-, -N(R N >, -C(FW>, -0-C(R 7 R7>, -C(R 6 R 6 >0-, -S- 
C(R7R7>, -C(R6R6>S-, -N(R N >C(R 7 R 7 *)-, -C(R6R6>N(RN>, and -C(R6R6>C(RW>; 

20 B is selected from hydrogen, hydroxy, optionally substituted d-4-alkoxy, optionally 
substituted d-4-alkyl, optionally substituted C1-4-acyloxy, and the nucleobases; P 
designates an internucleoside linkage to an adjacent monomer, or a 5-terminal group, such 
internucleoside linkage or 5 -terminal group optionally including the substituent R 5 ; one of 
the substituents R 2 , R 2# , R 3 , and R 3 * is a group P* which designates an internucleoside 

2 5 linkage to an adjacent monomer, or a 3-terminal group; one or two pairs of non-geminal 
substituents selected from the present substituents of R 1 *, R*\ R$ R**, R6, R6* ( R7 pj\ rn^ 
and the ones of R 2 , R 2 *, R 3 , and R 3 * not designating P* each designates a covalent bridging 
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moiety consisting of one or more of the following substituents: -C(RaRU)-, -C(Ra)=C(Rb)-, - 
Cm = -0-, -Si(RaRb)-, -s-, -SQ2-, -N(R a )-, and > C=2, wherein Z is selected from -0-, 
S-, and -N(R 8 )-, and R* and R" each is independently selected from hydrogen, optionally 
substituted CWalkyl, optionally substituted CMz-alkenyl, optionally substituted C 2 - 12 - 
alkynyl, hydroxy, Ci.i2-alkoxy, C 2 -i2-alkenyloxy, carboxy, Ci.^alkoxycarbonyl, C1.12- 
alkylcarbonyl, formyl, aryl, aryloxycarbonyl, aryloxy, arylcarbonyl, heteroaryl, 
heteroaryloxycarbonyl, heteroaryloxy, heteroarylcarbonyl, amino, mono- and di-fC^- 
alkyl)amino, carbamoyl, mono- and dKCi^alkylJaminocarbonyl, amino-C«- 
alkylaminocarbonyl, mono- and di-(Ci^alkyl)amino-Ci^-alkylaminocarbonyl, C1* 
alkylcarbonylamino, carbamide, C^-alkanoyloxy, sulphono, Cu-alkylsulphonyloxy, nitro. 
azido, sulphanyl, Ce-alkylthio, and the halogens, where aryl and heteroaryl may be 
optionally substituted, and where two geminal substituents Ra and R° together may 
designate optionally substituted methylene ( =CH2), and wherein two non-geminal or 
geminal substituents selected from R*. Rb and any of the substituents Ri*. R 2 , Rr rs, r3-_ 
R 4 *, R 5 , R 5 *, R 6 and R6*. R>, and W which are present and not involved in P, P*, or the 
covalent bridging moiety or moieties together may form an associated bridging moiety 
selected from substituents of the same kind as defined before; 
the pair(s) of non-geminal substituents thereby forming a mono- or bicyclic entity together 
with (i) the atoms to which the non-geminal substituents are bound and (ii) any intervening 
atoms; and 

each of the substituents Ri*, R?, Rr, rs R 4-, R6> rs- R6 and R6 * R 7 and R r ^ are 
present and not involved in P, P*. or the covalent bridging moiety or moieties is 
independently selected from hydrogen, optionally substituted Cua-alkyl, optionally 
substituted C 2 .i2-alkenyl, optionally substituted C 2 -i 2 -alkynyl, hydroxy, C^alkoxy, Cm 2 - 
alkenyloxy, carboxy, Ci-i 2 -alkoxycarbonyl, Ci-i 2 -alkylcarbonyl, formyl, aryl, aryloxycarbonyl, 
aryloxy, arylcarbonyl, heteroaryl, heteroaryloxycarbonyl, heteroaryloxy, heteroarylcarbonyl, 
amino, mono- and di(Cu-alkyl)amino, carbamoyl, mono- and di(Ci*-alkyl)aminocarbonyl, 
amino-d^-alkylaminocarbonyl, mono- and di(C^-alkyl)amino-C^-alkylaminocarbonyl, C% 
6-alkylcarbonylamino, carbamido, d-s-alkanoyloxy, sulphono, d-e-alkylsulphonyloxy, nitro, 
azido, sulphanyl, d-e-alkylthio, and halogens, where aryl and heteroaryl may be optionally 
substituted, and where two geminal substituents together may designate oxo, thioxo, imino, 
or optionally substituted methylene, or together may form a spiro bridging moiety consisting 
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of a 1-5 carbon atom(s) alkylene chain which is optionally interrupted and/or terminated by 
one or more substituents selected from -0-, -S-, and -(NR N )- where R N is selected from 
hydrogen and Ci4-alkyl, and where two adjacent (non-geminal) substituents may designate 
an additional bond resulting in a double bond; 
5 and R N *, when present and not involved in a covalent bridging moiety, is selected from 
hydrogen and d-4-alkyl; and basic salts and acid addition salts thereof. 

In the above-described structures, the terms "nucleobase" and "base" 
cover naturally-occurring nucleobases as well as non-naturally occurring and modified 
nucleobases. As would be understood, various nucleobases which previously have been 

1 o considered "non-naturally occurring" have subsequently been found in nature. Thus, 

"nucleobase" or "base" include not only the known purine and pyrimidine heterocycles, but 
also heterocyclic analogs and tautomers thereof. Illustrative examples of nucleobases are 
adenine, guanine, thymine, cytosine, uracil, purine, xanthine, diaminopurine, 8-oxo-N 6 - 
methyladenine, 7-deazaxanthine, 7-deazaguanine, N 4 ,N 4 -ethanocytosine, N 6 ,N 6 -ethano-2,6- 
1 5 diaminopurine, 5-methylcytosine, 5-(C 3 -C 6 )-alkynylcytosine, 5-fluorouracil, 5-bromouracil, 
pseudoisocytosine, 2-hydroxy-S-methyl4-triazolopyridine, isocytosine, isoguanine, inosine 
and the "non-naturally occurring" nucleobases described in U.S. Patent No. 5,432,272, the 
disclosure of which is incorporated herein by reference in its entirety. The terms 
"nucleobase" and "base" are intended to cover each of these examples as well as analogs 

2 o and tautomers thereof. Especially useful nucleobases are adenine, guanine, thymine, 

cytosine, and uracil. 

As evident from the general formula above, and the definitions associated 
therewith, there may be one or several asymmetric carbon atoms present in the LNA- 
containing annealing oligonucleotides in the methods of the present invention, depending 
25 on the nature of the substituents and possible covalent bridging moieties. LNA-containing 
annealing oligonucleotides used in the present invention are intended to include all 
stereoisomers arising from the presence of any and all isomers of the individual monomer 
fragments as well as mixtures thereof, including racemic mixtures. Also included within the 
scope of the invention are variants of the general formula where B is in the a-configuration. 

30 In one series of useful embodiments, the second oligonucleotide may 

include LNAs such as those disclosed in WO 99/14226 and U.S. Patent No. 6,268,490, the 
disclosures of which are incorporated herein by reference in their entireties, which contain a 
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methylene bridge connecting the 2'-oxygen of the ribose with the 4'-carbon according to the 
following formula: 



x o 




where B is a nucleobase, and X and Y are internucleoside linkages. 

In other interesting embodiments of this LNA structure, the 2'-oxygen 
position is substituted with nitrogen or sulfur as shown in the following structures: 



x — o^. 



10 




x — o. 




15 



where B is a nucleobase, and X and Y are internucleoside linkages. 

Other embodiments of the basic LNA structure usefully included in the 
annealing oligonucleotides of the present invention are disclosed in WO 99/14226. In these 
embodiments, the covalent bridging moiety may include more than one carbon atom and 
may span other positions within the ribose ring according to the following structures: 
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where B is a nucleobase, and X and Y are internucleoside linkages. 

In yet other LNA alternatives, the annealing oligonucleotides may include at 
5 least one nucleoside having a xylo-LNA structure as disclosed in WO 00/56748, the 
disclosure of which is incorporated herein by reference in its entirety, and having the 
general formula: 




where the internucleoside linkages are designated by P and P\ and the other groups may 
be the substituents disclosed in WO 00/56748. Specific examples of this analog are 
disclosed in WO 00/50748 with the following structural framework: 




where B is a nucleobase, and X and Y are internucleoside linkages. Also disclosed in 
WO 00/56748 and considered within the scope of the current invention are nucleoside 
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analogs that contain linkages between the 2' and 5 1 carbons of the ribose ring: 




where B is a nucleobase, and X and Y are internucleoside linkages. 

Other embodiments of the annealing oligonucleotide may comprise at least 
one nucleoside having an L-Ribo-LNA structure as disclosed in WO 00/66604, the 
disclosure of which is incorporated herein by reference in its entirety, and having the 



general formula: 




Rr 

R 3 * R 2 * 



where the internucleoside linkages are designated by P and F, and the other groups may 
be the substituents disclosed in WO 00/66604. Specific examples of this analog are 
disclosed in WO 00/66604 with the following structural framework: 



>-JH 



B is a nucleobase, and X and Y are internucleoside linkages. 
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Yet other embodiments contain the nucleoside analogs disclosed in U.S. 
Patent No. 6,043,060, the disclosure of which is incorporated herein by reference in its 
entirety. These analogs are represented by monomer units of the general formula: 




where B is a pyrimidine or purine nucleic acid base, or a derivative thereof, and where, 
within an annealing oligonucleotide, the plurality of B substituents may be identical to or 
different from one another. 

1 o Synthesis of LNA nucleosides and nucleoside analogs and oligonucleotides 

that contain them may be performed as disclosed in WO 99/14226, WO 00/56748, 
WO 00/66604, WO 98/39352, U.S. Patent No. 6,043,060, and U.S. Patent No. 6,268,490. 
Many may now be ordered commercially (Exiqon, Inc., Vedbaek, Denmark; Proligo LLC, 
Boulder, CO, USA). 

1 5 The annealing oligonucleotide in the methods of the present invention may 

alternatively, or in addition, include ribonucleic acid residues. Among such residues, 2'OMe 
residues are conveniently used. 

The annealing oligonucleotide may also usefully include peptide nucleic 
acid (PNA) residues, oligonucleotide analogs in which the deoxyribose backbone of the 

2 o oligonucleotide is replaced by a peptide backbone, such as repeating units of N-(2- 

aminoethyl)glycine linked through amide bonds. 

Peptide nucleic acids and methods for their synthesis have by now been 
amply well described, and need not here be described in detail. For recent review, see 
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Nielsen etal., Curr. Issues. Mol. Biol. 1(1-2):89-104 (1999); Nielsen, Curr. Med. Chem. 
8(5):545-50 (2001); Nielsen, Curr. Opin. Biotechnol. 12(1):16-20 (2001); Good etal., 
Antisense Nucleic Acid Drug Dev. 7(4):431-7 (1997); Eriksson ef a/., Q. Rev. Biophys. 
29(4):369-94 (1996); Hyrup et al., Bioorg. Med. Chem. 4(1):5-23 (1996); Nielsen, Annu. 
Rev. Biophys. Biomol. Struct. 24:167-83 (1995); Nielsen etal., Bioconjug. Chem. 5(1):3-7 
(1994), the disclosures of which are incorporated herein by reference in their entireties. 
tBoc PNA monomers and reagents for use in PNA synthesis are available commercially 
from Applied Biosystems, Inc. (Foster City, CA, USA), which also provides custom PNA 
oligonucleotide synthesis services. 

The annealing oligonucleotides may also usefully include 2-aminoadenine 
and/or cytosine/uracff substituted at the 5 position with a methyl, propynyl or bromo group. 

The annealing oligonucleotide may be composed entirely of modified 
bases, or may instead comprise both modified and natural bases. The annealing oligo thus 
includes at least about 30%, 40%, 50%, even at least about 60%, 70%, 75%, 80%, 90%, 
95%, and even 100% modified bases. The lower limit for percentage inclusion of modified 
bases is determined, for any given annealing oligonucleotide sequence and composition, by 
the requirement that the oligonucleotide not bind substantial amounts of recombinase. The 
annealing oligonucleotide does not bind substantial amounts of recombinase if the 
annealing oligonucleotide, after incubation with RecA under conditions that favor filament 
formation, is alone incapable of mediating the formation of a single D-loop in the target 
query region. Typically, recombinase binding by the annealing oligonucleotide is 
undetectable. 

The second oligonucleotide is typically single-stranded, and is typically less 
than about 50 nt in length, often less than about 49, 48, 47, 46, 45, 44, 43, 42, 41, even no 
more than about 40 nt in length, and may be as short as 39, 38, 37, 36, 35, 34, 33, 32, 31 
even as short as 30, 29, 28, 27, 26, 25, 24, 23, 22, 21 or 20 nt in length. The second ' 
oligonucleotide may even be as short as 19, 18, 17, 16, or even as short as 15 nt in length. 
Annealing oligonucleotides of 15 nt are exemplified in the Examples herein below. 

The second oligonucleotide includes a complementarity region. 
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The second oligonucleotide complementarity region is perfectly 
complementary in sequence to a second strand of the target across at least a portion of the 
target query region. 

Typically, the second oligonucleotide complementarity region will be shorter 
5 than that of the first oligonucleotide, will not encompass regions of the target outside the 
target query area, and will typically be less than about 50 nt, 45 nt, 40 nt, 35 nt, 30 nt, 29 nt, 
28 nt, 27 nt, 26 nt, or even less than about 25 nt in length. The second oligonucleotide 
complementarity region will typically be at least about 15 nt in length, may be 16, 17, 18, 19, 
20, 21, 22, 23, 24, even at least about 25 nt in length, although longer regions of 

1 o complementarity are permissible. 

Because the first oligonucleotide is perfectly complementary in sequence to 
a first strand of the target across the entirety of the target query region, and the second 
oligonucleotide is perfectly complementary to a second strand of the target across at least a 
portion of the target query region, the first and second oligonucleotides have at least a 
15 region of perfect complementarity therebetween, also termed an "overlap region". The 
overlap region may extend the entirety of the second oligonucleotide complementarity 
region, or may instead include only a portion thereof. The overlap region is at least about 5 
nt, typically at least about 10 nt, and is often at least about 1 1 nt, 12 nt, 13 nt, 14 nt, 15 nt, 
and as long as about 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, even 50 nt. 

2 o The second oligonucleotide may include regions additional to the 

complementarity region. As further discussed below, such regions may usefully be used, 
for example, to provide genotypic labels ("bar codes", or "tags") that uniquely identify the 
oligonucleotide; to provide priming sites for oligonucleotide-mediated amplification 
reactions; to provide phage promoter sites for generating RNA transcripts; and to provide 

2 5 restriction sites to facilitate cloning of the target region. 

At least one of the oligonucleotide complementarity regions is imperfectly 
complementary to its respective strand of the query region of each of the target variants 
desired selectively to be discriminated from the target. By "imperfectly complementary" is 
intended at least one mismatch with respect to the respective strand of the target variant. 

3 0 As demonstrated in the Examples set forth herein below, a single base mismatch as 

between at least one of the first and second oligonucleotides, on the one hand, and the 
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target variant query region, on the other, causes discriminably decreased formation of 
deproteinization-stable double-D loops at nonsupercoiled variant query regions as 
compared to formation at the exactly matched nonsupercoiled target. Further mismatches 
can provide a greater difference in degree of formation. 

5 By "deproteinization-stable" is intended a degree of stability sufficient to 

permit Dloop or dDloop detection following deproteinization. 

In certain embodiments, therefore, the methods of this aspect of the 
present invention further include the additional step, after formation of single or double ID- 
loops, and typically (but not invariably) before detection, of deproteinizing the nucleic acids 
io in the sample. 

Deproteinization may be accomplished, for example, by treatment with 
SDS or proteinase K, as well as by standard chemical deproteinization methods, such as 
phenol-based methods, at temperatures up to 25°C, up to 30°C, 35°C, 37°C, at times up to 
40°C, 41 °C, 42°C, 43°C, 44°C, or 45°C, and even at temperatures as high as 46°C, 47°C, 
15 48<C, 49°C, 50°C, 51°C, 52°C, 53«C, 54°C, or 55°C, and on occasion at temperatures as' 
high as 56°C, 57»C, 58°C, 59°C, even 60°C, for times up to 1 , 2, 2.5, 3, 4, or 5 minutes, 
even as long as 6, 7, 8, 9 or 10 minutes, although the shorter incubation times are more 
typical. 

Depending upon the length and sequence of the complementarity regions, 
2 o the base composition of the annealing oligonucleotide, the nature of the recombinase, and 
other factors, the single or double D-loops at the target query region are stable after 
deproteinization for periods of up to 1, 2, 2.5, 3, 4, or5 minutes, even as long as 6, 7,8, 9 
or even 10 minutes, at temperatures as high as 20°C, 25°C, up to 30°C, 35 6 C, 37°C, at 
times up to 40°C, 41 °C, 42°C, 43°C, 44°C, or 45'C, and even at temperatures as high as 
25 46°C, 47°C, 48°C, 49°C, 50°C, 51°C, 52°C, 53=C, 54«C, or 55°C, and on occasion at 

temperatures as high as 56°C, 57°C, 58°C, 59°C, even 60°C. The deproteinized Dloops or 
dDloops are stable for longer periods, up to 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 
hours, even up to 10-18 hours or more at temperatures no higher than 4°C, and may be 
stable for periods longer than 1 day at temperatures of -20 e C or below. 
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In the single D-loop methods of the present invention, the first 
oligonucleotide is detectable. In the double D-loop methods of the present invention, 
whether for distinguishing supercoiled or nonsupercoiled targets, at least one of the first and 
second oligonucleotides is detectable, optionally both. 

5 In embodiments in which both first and second oligonucleotides are 

detectable, the first and second oligonucleotides may be distinguishably detectable with 
respect to one another. In embodiments in which a plurality of first and second 
oligonucleotide pairs are used, further described below, each first oligonucleotide may be 
distinguishably detectable from the others of the first oligonucleotides, and each second 

1 o oligonucleotide may be distinguishably detectable from the others of the second 

oligonucleotides. In these latter embodiments, the first and second oligonucleotides of each 
separate pair may or may not be distinguishable therebetween. 

In a variety of embodiments, either or both of the first and second 
oligonucleotides may include at least one label, which can be directly or indirectly 
15 detectable. 

Detectable labels include, e.g., a radionuclide, a fluorophore, a 
fluorescence resonance energy transfer ("FRET") tandem fluorophore, a FRET donor 
and/or acceptor, or a mass tag. Indirectly detectable labels include, e.g., an enzyme, a 
genotypic label, or a hapten. 

20 The label may, for example, be a radionuclide, such as 33 P, 32 P, ^S, and 

3H. 

The label may instead be a fluorophore. Commercially available 
fluorescent nucleotide analogues readily incorporated into the first and/or second 
oligonucleotides include, for example, Cy3-dCTP, Cy3-dUTP, Cy5-dCTP, Cy3-dUTP 

2 5 (Amersham Biosciences, Piscataway, New Jersey, USA), fluorescein-1 2-dUTP, 

tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade Blue®-7-dUTP, BODIPY® 
FL-14-dUTP, BODIPY® ™R-14-dUTP, BODIPY® TR-14-dUTP, Rhodamine 
Green™-5-dUTP, Oregon Green® 488-5-dUTP, Texas Red®-12-dUTP, BODIPY® 
630/650-14-dUTP, BODIPY® 650/665-1 4-dUTP, Alexa Fluor® 488-5-dUTP, Alexa Fluor® 

3 o 532-5-dUTP, Alexa Fluor® 568-5-dUTP, Alexa Fluor® 594-5-dUTP, Alexa Fluor® 

546-14-dllTP, fluorescein-12-UTP, tetramethylrhodamine-6-UTP, Texas Red®-5-UTP, 
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Cascade Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-14-UTP, BODIPY® 
TR-14-UTP, Rhodamine Green™- 5 .UTP, Alexa Fluor® 488-6-UTP, Alexa Fluor® 
546-14-UTP (Molecular Probes, Inc. Eugene, OR, USA). Protocols are available for custom 
synthesis of nucleotides having other fluorophores. Henegariu etai, "Custom Fluorescent- 
Nucleotide Synthesis as an Alternative Method for Nucleic Acid Labeling." Nature 
Biotechnol. 18:345 - 348 (2000), the disclosure of which is incorporated herein by reference 



Other fluorophores available for post-synthetic attachment include, inter 
alia, Alexa Fluor® 350, Alexa Fluor® 532, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluo,® 647, BODIPY 493/503, BODIPY FL, BODIPY R6G, BODIPY 530/550, BODIPY 
TMR, BODIPY 558/568, BODIPY 558/568, BODIPY 564/570, BODIPY 576/589 BODIPY 
581/591, BODIPY 630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl 
lissamine rhodamine B, Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, USA), and Cy2, Cy3.5, Cy5.5, and Cy7 
(Amersham Biosciences, Piscataway, NJ USA, and others). 

FRET tandem fluorophores may also be used, such as PerCP-Cy5 5 PE- 
Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC-Cy7. 

Pairs of individual fluorophores that can participate in FRET may also be 
used, with the two fluorophores of the pair present respectively on first and second 
oligonucleotides or present together on either or both of the first and second 
oligonucleotides. 

Such FRET fluorophore pairs are well known in the art and include, e.g., 
fluorescein isothiocyanate (FITC) and tetramethylmodamine isothiocyanate (TRITC); 
FITC/Texas Red™ (Molecular Probes, Inc.); FITC/N-hydroxysuccinimidyl 1-pyrenebutyrate 
(PYB); FITC/eosin isothiocyanate (EITC); FITC/tetramethylmodamine (TAMRA); and 
various Alexa Fluor pairs (Molecular Probes, Inc.), such as Alexa Fluor 488 and any one of 
Alex Fluors 546, 555, 568, 594, or 647. 

FRET donors may also be paired with quenchers, with the donor and 
quencher of the pair present respectively on first and second oligonucleotides, or present 
together on either or both of the first and second oligonucleotides. 



WO 03/027640 



34 



PCT/US02/31073 



Suitable fluorophore-quencher pairs are well known in the art and include, 
e.g., FAM, HEX, TAMRA, EDANS and Texas Red™ as fluorophore, with dabcyl useful as a 
common quencher. Other quenchers include, e.g., QSY 7, QSY 9, QSY 21 and QSY 35 (all 
from Molecular Probes, Inc., Eugene, OR); gold nanoparticles (Dubertret ef a/., Nature 
5 Biotechnol. 1 9:365-370 (2001 )), and other metals and metalloids, both as macroscopic 
solids and nanoparticles, including particles in colloid suspension. 

Labels that are detectable by mass spectrometry, "mass tags", may also be 
used. Mass tags can be designed to provide hundreds of mass spectrally distinguishable 
species, allowing highly multiplexed reactions, and can be designed to be cleavable, 

1 o typically photochemically cleavable, from the labeled nucleic acid, simplifying analysis. See, 

e.g., Kokoris ef a/., Moi Diagn. 5(4):32940 (2000) and Pusch ef a/., Pharmacogenomics 
3(4):537-48 (2002), the disclosures of which are incorporated herein by reference in their 
entireties. 

At least one oligonucleotide, optionally both, may instead or in addition 
1 5 include at least one indirectly detectable label. 

For example, the oligonucleotide may include an enzyme. 

Enzymes useful for colorimetric detection are well known, and include 
alkaline phosphatase, p-galactosidase, glucose oxidase, horseradish peroxidase (HRP), 
and urease. Typical substrates for production and deposition of visually detectable products 

2 o include o-nitrophenyl-beta-D-galactopyranoside (ONPG); o-phenylenediamine 

dihydrochloride (OPD); p-nitrophenyl phosphate (PNPP); p-nitrophenyl-beta-D- 
galactopyranoside (PNPG); 3',3'-diaminobenzidine (DAB); 3-amino-9-ethylcarbazole (AEC); 
4-chloro-1-naphthol (CN); 5-bromo-4-chloro-3-indolyl-phosphate (BCIP); ABTS®; BluoGal; 
iodonitrotetrazolium (INT); nitroblue tetrazolium chloride (NBT); phenazine methosulfate 

2 5 (PMS); phenolphthalein monophosphate (PMP); tetramethyl benzidine (TMB); tetranitroblue 

tetrazolium (TNBT); X-Gal; X-Gluc; and X-Glucoside. 

Enzymes may also be used for luminescent detection. Luminescent labels, 
such as enhanced chemiluminescence labels, are well known in the art. 

For example, in the presence of hydrogen peroxide (H202), horseradish 

3 o peroxidase (HRP) can catalyze the oxidation of cyclic diacylhydrazides, such as luminol, 

with subsequent light emission. Strong enhancement of the light emission can be produced 
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by enhancers, such as phenolic compounds. Thorpe ef a/., Methods Enzymol. 133:331-53 
(1986); Kricka et a/., J. Immunoassay 17(1):67-83 (1996); and Lundqvist ef a/., J. Biolumin. 
Chemilumin. 10(6):353-9 (1995), the disclosures of which are incorporated herein by 
reference in their entireties. Kits for such chemiluminescent and enhanced 
chemiluminescent detection of nucleic acids are available commercially. 

Either or both of the first and second oligonucleotides may include a 
"genotypic label" - variously termed a "bar code tag" or "tag" in the art - which are short 
sequences designed algorithmically to maximize discrimination on a micraarray having 
complements of the respective tags; a 1:1 correspondence as between tag sequence and 
first and/or second oligonucleotide permits each oligonucleotide so labeled to be detected 
by detection of the barcode uniquely associated therewith. See, e.g., Shoemaker etal., 
Nature Genet. 14(4):450-6 (1996); EP 0799897; Fan etal., Genome Res. 10:853-60 (2000); 
and U.S. Patent No. 6,150,516, the disclosures of which are incorporated herein by 
reference in their entireties. 

The oligonucleotides can, in addition or in the alternative, include a 
detectable hapten, typically a hapten that is indirectly detectable. Haptens that are 
commonly conjugated to nucleotides for incorporation into oligonucleotides include biotin 
(biotin-11-dUTP, Molecular Probes, Inc., Eugene, OR, USA; biotin-21-UTP, biotin-21-dUTP, 
Clontech Laboratories, Inc., Palo Alto, CA, USA), digoxigenin (DIG-1 1-dUTP, alkali labile, 
DIG-11-UTP, Roche Diagnostics Corp., Indianapolis, IN, USA), and dinitrophenyl 
(dinitrophenyl-1 1-dUTP, Molecular Probes, Inc., Eugene, OR, USA). 

Depending upon the choice of label, deproteinization-stable single or 
double D-loops may be detected in the methods of the invention by detecting radiation, as 
by autoradiography or phosphorimaging; by detecting fluorescence, including standard 
fluorescence, fluorescence resonance energy transfer, fluorescence polarization, and time- 
resolved fluorescence; colorimetrically; by mass spectrometry, typically time-of-flight mass 
spectrometry; or luminescently. 

In distinguishing supercoiled targets from target variants, we have 
observed increased selectivity under certain conditions by further contacting the sample 
with a third oligonucleotide. 
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Thus, in certain embodiments, the methods further comprise contacting the 
sample with a third oligonucleotide, either contemporaneously with or subsequent to contact 
with the first and/or second oligonucleotide. 

The third oligonucleotide is typically designed essentially according to the 
5 criteria described above for design of the second oligonucleotide, which description is 
incorporated here by reference, except that the third oligonucleotide includes a 
complementarity region that is perfectly complementary in sequence to at least a portion of 
the second strand of the query region of a target variant as to which the target is desired to 
be discriminated, and imperfectly complementary in sequence to at least a portion of the 

1 o query region of the target itself. 

The third oligonucleotide may, but need not, be detectable. If detectable, it 
is distinguishably detectable from the second oligonucleotide, and optionally from the first 
oligonucleotide. If detectable, the third oligonucleotide may be rendered detectable by any 
of the approaches above-described for the first and/or second oligonucleotides, which 
15 description is incorporated here by reference in its entirety, 

In the methods of the first aspect of the present invention, the nucleic acid 
target and the variants to be distinguished therefrom are present in the nucleic acid sample 
in topologic forms other than supercoils. In the methods of the second aspect of the present 
invention, the nucleic acid target and variants to be distinguished therefrom are supercoiled. 

2 o Nonsupercoiled target and variants may, for example, be linear duplexes, 

partially duplexed linear molecules, or nonsupercoiled circles, such as nicked circles or 
relaxed covalently closed circles. Linear or partially linearized samples are characteristic, 
for example, of cellular genomic DNA preparations from eukaryotes, artificial chromosomes, 
such as YACs, PCR products, and various viral preparations. Nonsupercoiled circles are 

2 5 characteristic, for example, of plasmid, cosmid, or BAG preparations that have been 

relaxed, either by nicking (as by nucleases, chemical treatment, UV exposure, or physical 
shearing) or by treatment with a topoisomerase. Supercoiled target and variants typically 
will be present within plasmids or cosmids. 

Both nonsupercoiled and supercoiled samples may usefully include 

3 o genomic DNA targets. 
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Genomic DNA samples may be derived directly from cellular chromosomes 
or may instead be derived from recombinant sources. Samples derived directly from 
cellular chromosomes lack vector sequences and will typically be in nonsupercoiled form; 
samples derived from recombinant sources include vector sequences in addition to the 
genomic target and may be supercoiled. Cellular genomic samples may include DNA in 
aqueous composition, or may instead include DNA in desiccated or fixed foim as in 
chromosomal spreads typically used for karyotyping. Recombinant genomic samples will 
typically be in aqueous composition. 

Nonsupercoiled recombinant genomic targets usefully discriminated from 
variants in the methods of the present invention include genomic targets present within 
artificial chromosomes, such as YACs (yeast artificial chromosomes), BACs (bacterial 
artificial chromosomes), PACs (P-1 derived artificial chromosomes), HACs (human artificial 
chromosomes), and PLACs (plant artificial chromosomes). Artificial chromosomes are 
reviewed in Larin etal, Trends Genet. 18(6):313-9 (2002); Choi etal., Methods Mol. Biol. 
175:57-68 (2001); Brune etal., Trends Genet. 16(6):254-9 (2001); Ascenzioni ef at, Cancer 
Lett. 1 18(2):135-42 (1997); Fabb ef a/., Mol. Cell. Biol. Hum. Dis. Ser. 5:104-24 (1995); 
Huxley, Gene Ther. 1(1):7-12 (1994), the disclosures of which are incorporated herein by 
reference in their entireties. Other vectors capable of including genomic DNA in 
nonsupercoiled form include various viral, typically eukaryotic viral, vectors, such as 
adenoviral, varicella, and herpesvirus vectors. 

In certain useful embodiments, genomic targets to be distinguished from 
their respective variants are present within vectors that permit integration of the target into a 
cellular chromosome. In particularly useful embodiments, genomic targets are present 
within vectors that permit site-directed integration of the target into a cellular chromosome. 
Usefully, the vector is an artificial chromosome and site-specific integration may be 
performed by recombinase mediated cassette exchange (RMCE). 

In RMCE, a region of DNA (cassette) desired to be integrated into a 
specific cellular chromosomal location is flanked in a recombinant vector by sites that are 
recognized by a site-specific recombinase, such as loxP sites and derivatives thereof for 
Cre recombinase and FRfsites and derivatives thereof for Flp recombinase. Other site- 
specific recombinases having cognate recognition/recombination sites useful in such 
methods are known (see, e.g., Blake etal., Mol Microbiol 23(2):387-98 (1997)). 
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The site in the cellular chromosome into which the cassette is desired site- 
specifically to be integrated is analogously flanked by recognition sites for the same 
recombinase. 

To favor a double-reciprocal crossover exchange reaction between vector 
5 and chromosome, two approaches are typical. In the first, the two sites (such as fox or 
FRT) that flank the cassettes in both vector and cellular chromosome are heterospecific: 
that is, they differ from one another and recombine with each other with far lower efficiency 
than with sites identical to themselves. In the second, the fox or FRT sites are inverted. 
See, e.g., Baer ef a/., Curr. Opin. Biotechnol. 12:473480 (2001); Langer ef a/., Nucl. Acids 
10 Res. 30:3067-3077 (2002); Feng ef a/., J. Mol. Biol. 292:779-785 (1999), the disclosures of 
which are incorporated herein by reference in their entireties. 

Recombinational exchange of the cassettes from vector to cellular 
chromosome, with integration of the construct cassette site-specifically into the cellular 
chromosome, is effected by introducing the recombinant construct into the cell and 
1 5 expressing the site-specific recombinase appropriate to the recombination sites used. The 
site-specific recombinase may be expressed transiently or continuously, either from an 
episome or from a construct integrated into cellular chromosome, using techniques well 
known in the art. 

In such embodiments, in which genomic targets to be distinguished from 

2 o their respective variants are present within vectors that permit site-specific integration into 

chromosomes, the methods of the present invention can be used to detect and discriminate 
those vectors that include targets desired to be integrated site-specifically into a cellular 
chromosome, with selectivity sufficient to distinguish the genomic target from variants that 
differ from it by as few as one nucleotide. 
25 As described in further detail below, the methods of the present invention 

further permit the separation and purification of targets from variants thereof, and also from 
unrelated targets. In embodiments in which genomic targets to be distinguished from their 
respective variants (or other targets) are present within vectors that permit site-specific 
integration into chromosomes, the methods of the present invention can be used to 

3 o separate and purify those vectors that include targets desired to be integrated site- 
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specifically into a cellular chromosome, with selectivity sufficient to isolate the genomic 
target from variants that differ from it by as few as one nucleotide. 

Such embodiments may usefully be combined with methods for targeting 
nucleic acid sequence changes to the genomic targets present within such vectors. 

We have recently described methods for targeting single nucleotide 
changes directly into long pieces of genomic DNA present within YACs, BACs, and even 
intact cellular chromosomes through use of sequence-altering oligonucleotides. See 
international patent publication nos. WO 01/73002, WO 01/92512, and WO 02/10364; and 
commonly owned and copending U.S. provisional patent application nos. 60/326,041, filed 
September 27, 2001, 60/337,129, filed December 4, 2001, 60/393,330, filed July 1. 2002, 
60/363,341 , filed March 7, 2002; 60/363,053, filed March 7, 2002, and 60/363,054, filed ' 
March 7, 2002, the disclosures of which are incorporated herein by reference in their 
entireties. 

Other approaches for targeting sequence changes using sequence altering 
oligonucleotides have also been described. See e.g. U.S. Patent Nos. 6,303,376; 
5,776,744; 6,200,812; 6,074,853; 5,948,653; 6,136,601; 6,010,907; 5,888,983; 5,871,984; 
5,760,012; 5,756,325; and 5.565,350, the disclosures of which are incorporated herein by' 
reference in their entireties. 

Using such targeting methods, small changes, including single nucleotide 
changes, can be targeted to genomic targets that are present within recombinant vectors. 
The methods of the present invention can then be used to identify, quantify, separate, and 
purify the successfully targeted constructs. If the target is present in a vector that permits 
the subsequent site-specific integration of the target into a cellular chromosome, the 
successfully targeted variant can then be integrated site-specifically into a cellular 
chromosome. 

This approach provides a single-copy integrant of defined and chosen 
sequence in a defined cellular genomic milieu. It is known that such site-specific integration 
provides more consistent expression than does random integration. Feng etal., J. Mol. 
Biol. 292:779-285 (1999). 

We have recently described the use of oligonucleotide-mediated sequence 
alteration in the creation of collections of coisogenic eukaryotic cell lines- collections of 
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genotypically distinct cells, derived from a common ancestor cell, that are engineered to 
differ from one another in genomic sequence at a predetermined target locus. Such 
coisogenic culture collections find utility, for example, in pharmacogenomics applications. 
See commonly owned and copending U.S. provisional patent application no. 60/325,992, 
5 filed September 27, 2001 , the disclosure of which is incorporated herein by reference in its 
entirety. 

The methods of the present invention can be used to facilitate creation of 
such coisogenic cell collections. 

With genomic samples, whether obtained directly from cellular 

1 o chromosomes or within recombinant vectors, the query region may be within the coding 

region of a gene, within an intergenic region, including promoter or enhancer elements, 
within an intron if the genomic sample is eukaryotic, or within the 5' or 3' UTR of a gene. In 
recombinant genomic samples, the target query region may be within a vector sequence. 

In another series of embodiments, the nucleic acid sample comprises 
1 5 cDNA, either in nonsupercoiled topologies - e.g. as a cDN A population lacking replicable 
vector sequences, as from an RT-PCR or cDNA preparation prior to ligation into a 
vector - or in supercoiled form, as in a plasmid cDNA library. 

The sample may contain as little as 100 pg of target nucleic acid, typically 
no less than 1 ng of target nucleic acid, often 10 ng - 100 ng of target nucleic acid, and may 

2 o include more, including 200 ng, 300 ng, 400 ng, 500 ng of target DNA. 

The nucleic acids of the sample to be queried may be derived from a single 
individual or pooled from a plurality of individuals. The plurality may include as few as two 
individuals, and may include as many as 3, 4, 5, 10, 20, 30, 40, 50, 75, or even as many as 
100 individuals. 

2 5 The nucleic acids may be derived from a body fluid, either a fluid having 

formed cellular elements that include nucleic acids or a fluid that lacks formed elements but 
otherwise includes nucleic acids, including blood, urine, sweat, saliva, sputum, semen, 
vaginal secretions, cerebrospinal fluid, lymph, breast nipple aspirate, pus, aqueous humor, 
vitreous humor, and amniotic fluid. The nucleic acids may be derived from a surgical 

3 o sample, such as a biopsy, aspirate, or lavage. The nucleic acids may also usefully be 

derived from stool or scat. 
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In yet other embodiments, the sample may be drawn from cultured cells. 

The nucleic acid samples and the targets therein may be drawn from viral, 
prokaryotic or eukaryotic sources. 

Among viruses, targets that may usefully be discriminated from variants 
5 using the methods of the present invention include targets present in double-stranded DNA 
viruses, such as herpesviruses, including human herpesvirus 1 and 2 (HSV-1 and HSV-2), 
varicella-zoster (HSV-3), cytomegalovirus (HCMV), human herpesvirus 6, 7, 8 (HHV-6, 
HHV-7, HHV-8), or Epstein-Barr virus (EBV). 

Other useful viral targets include double-stranded forms derived from 
retroviruses, such as provirus integrants or double-stranded cDNAs prepared synthetically 
from retrovirus RNA. Retroviruses having targets that may usefully be discriminated from 
variants using the methods of the present invention include, for example, mammalian type B 
retroviruses, such as mouse mammary tumor virus; mammalian type C retroviruses, such 
as murine leukemia virus and reticuloendotheliosis virus (strain T, A); avian type C 
retroviruses such as avian leukosis virus; type D retroviruses such as Mason-Pfizer monkey 
virus; BLV-HTLV retroviruses such as bovine leukemia virus; lentiviruses, such as bovine 
lentiviruses including bovine immunodeficiency virus, feline immunodeficiency virus, 
visna/maedi virus (strain 1514), and primate lentiviruses such as human immunodeficiency 
virus 1 (HIV1), human immunodeficiency virus 2 (HIV2), and simian immunodeficiency virus 
(SIV). 

Among prokaryotes, the nucleic acid sample used in the, methods of the 
present invention may usefully be drawn from eubacteria, including gram negative and gram 
positive bacteria, including E. coll 

Among eukaryotes, the nucleic acid sample and target may usefully be 
drawn from yeasts, such as S. cerevisiae, Schizosaccharomyces pombe, Ustillago maydis, 
Neurospora crassa and Candida albicans; mammals, such as primates, including humans, 
monkeys, and apes, small laboratory animals, such as rodents, including mouse or rat, 
guinea pigs, and rabbits, and livestock, such as cows, horses, chickens, goats, and sheep; 
plants, such as angiosperms, gymnosperms, and mosses, including Chlamydomonas 
reinhardtii, Physcomitrella patens, and Arabidopsis thaliana, and crop plants such as 
cauliflower (Brassica oleracea), artichoke (Cynara scolymus), fruits such as apples (Malus, 
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e.g. domesticus), mangoes [Mangifera, e.g. indica), banana [Musa, e.g. acuminata), berries 
(such as currant, Ribes, e.g. rubrum), kiwifruit [Actinidia, e.g. chinensis), grapes (Wf/s, e.g. 
wn/fera), bell peppers [Capsicum, e.g. annuum), cherries (such as the sweet cherry, 
Prunus, e.g. avium), cucumber [Cucumis, e.g. sativus), melons [Cucumis, e.g. me/o), nuts 
5 (such as walnut, Juglans, e.g. reg/a; peanut, Arachis hypogeae), orange (C/fn/s, e.g. 
maxima), peach (Prunus, e.g. pere/ca), pear [Pyra, e.g. communis), plum (Pmnus, e.g. 
domestical strawberry {Fragaria, e.g. moschata or vesca), tomato [Lycopersicon, e.g. 
esculentum)', leaves and forage, such as alfalfa [Medicago, e.g. saf/Va or truncatula), 
cabbage (e.g. Brass/ca oleracea), endive (Cichoreum, e.g. end/V/a), leek (A///um, e.g. 

1 o poram), lettuce (Lacfuca, e.g. saf/va), spinach (Spinacia, e.g. o/eraceae), tobacco 

[Nicotiana, e.g. tabacum)] roots, such as arrowroot [Maranta, e.g. arundinacea), beet (Befa, 
e.g. vulgaris), carrot (Daucus, e.g. carofa), cassava [Manihot, e.g. esculenta), turnip 
(Brass/ca, e.g. rapa), radish (Raphanus, e.g. saf/Vus), yam [Dioscorea, e.g. esculenta), 
sweet potato (Ipomoea batatas); seeds, including oilseeds, such as beans [Phaseolus, e.g. 
1 5 w/gar/s), pea (P/sum, e.g. saf/Vum), soybean (G/yc/ne, e.g. max), cowpea (Wgna 
unguiculata), mothbean ( Vigna aconitifolia), wheat {Triticum, e.g. aestivum), sorghum 
(Sorghum e.g. b/co/or), barley [Hordeum, e.g. w/gare), corn (Zea, e.g. mays), rice (O^yza, 
e.g. saf/Va), rapeseed (Brassica napus), millet {Panicum sp.), sunflower (Helianthus 
annuus), oats (Avena saf/Va), chickpea (C/cer, e.g. arietinum)', tubers, such as kohlrabi 

2 o [Brassica, e.g. o/eraceae), potato [Solanum, e.g. tuberosum) and the like; fiber and wood 

plants, such as flax [Linum e.g. usitatissimum), cotton [Gossypium e.g. hirsutum), pine 
(P/nus sp.), oak [Quercus sp.), eucalyptus [Eucalyptus sp.), and the like and ornamental 
plants such as turfgrass (Lo//um, e.g. rigidum), petunia (Pefun/a, e.g. x hybrida), hyacinth 
[Hyacinthusorientalis), carnation [Dianthuse.g. caryophyllus), delphinium [Delphinium, e.g. 

2 5 a/ac/s), Job's tears (Co/x lacryma-jobi), snapdragon [Antirrhinum majus), poppy [Papaver, 

e.g. nudicaule), lilac (Syr/nga, e.g. vu/gar/s), hydrangea [Hydrangea e.g. macrophylla), 
roses (including Gallicas, Albas, Damasks, Damask Perpetuals, Centifolias, Chinas, Teas 
and Hybrid Teas) and ornamental goldenrods (e.g. Solidago spp.). 

Useful targets derived from human nucleic acids include alleles known or 

3 o suspected to contribute to disease, to disease predisposition, or to responsiveness to 

therapeutic agents. 
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A large and increasing number of alleles of human genes that contribute to 
or predispose to disease have been identified. Such alleles are catalogued, inter alia, in the 
Human Gene Mutation Database (http://archive.uwcm.ac.uk/uwcm/ 

mg/hgmd0.html) and the Online Mendelian Inheritance of Man (OMIM™) 
(http://www.ncbi.nlm.nih.gov/entrez/ 

query.fcgi?db=OMIM). The methods of the present invention may be used to identify the 
presence of such alleles in a patient sample. 

Human genomic samples may also usefully be queried using the methods 
of the present invention for alleles known to affect responsiveness to therapeutic agents. 

Genetic polymorphisms in proteins, including the multidrug transporters 
and cytochromes, are known to play a role in drug sensitivity and in drug resistance. For 
example, the cytochrome P450 enzyme encoded by CYP2D6 is known to metabolize as 
many as 20% of commonly prescribed drugs. The enzyme's substrates include 
debrisoquine, an adrenergic-blocking drug; sparteine and propafenone, both anti-arrhythmic 
drugs; and amitryptiline, an anti-depressant. The gene is highly polymorphic in the 
population; certain alleles result in the poor metabolizer phenotype, characterized by a 
decreased ability to metabolize the enzyme's substrates. 

Thus, in one series of embodiments, the targets to be detected by the 
methods of the present invention are alleles of genes that affect drug metabolism, including: 
CYP1A2, CYP2C17, CYP2D6, CYP2E, CYP3A4, CYP4A11, CYP1B1, CYP1A1, CYP2A6, 
CYP2A13, CYP2B6, CYP2C8, CYP2C9, CYP11A, CYP2C19, CYP2F1, CYP2J2, CYP3A5, 
CYP3A7, CYP4B1, CYP4F2, CYP4F3, CYP6D1, CYP6F1, CYP7A1, CYP8, CYP1 1A, 
CYP11B1, CYP11B2 , CYP17, CYP19, CYP21A2, CYP24, CYP27A1, CYP51, ABCB1, 
ABCB4, ABCC1, ABCC2, ABCC3. ABCC4, ABCC5, ABCC6, MRP7, ABCC8, ABCC9, 
ABCC10, ABCC11, ABCC12, EPHX1, EPHX2, LTA4H, TRAG3, GUSB, TMPT, BCRP, 
HERG, hKCNE2, UDP glucuronosyl transferase (UGT), sulfotransferase, sulfatase, 
glutathione S-transferase (GST) -alpha, glutathione S-transferase -mu, glutathione S- 
transferase -pi, ACE, and KCHN2. 

In other embodiments, the targets can be alleles that affect responsiveness 
to a single drug, or a single class of drugs. For example, the targets can be alleles of 
genes, or variants of gene fusions, such as the BCR-ABL gene fusion, that affect 
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responsiveness to tyrosine kinase inhibitors, such as imatinib (Gleevec). Resistance- 
conferring alleles of BCR-ABL are disclosed in Shat ef a/., Cancer Cell 2:1 17-125 (2002), 
the disclosure of which is incorporated herein by reference in its entirety. 

In some embodiments, the sample includes at least one target variant in 
5 addition to the target desired selectively to be detected. 

' In certain of these embodiments, the target variant differs from the target by 
as few as one nucleotide in the common query region, and up to 2, 3, 4, 5, 6, 7, 8, 9, even 
10 nucleotides in the common query region. Frequently, the variant differs from the target in 
the common query region by no more than 9, 8, 7, 6 and even by no more than 5 

1 o nucleotides. In some embodiments, the sample includes at least one target variant in 

addition to the target desired selectively to be detected, the target and variant differing in the 
common query region by 1, 2, 3, 4, or 5 nucleotides. 

The variant may be a naturally-occurring allelic variant, a separate member 
of a gene family, an orthologue from a heterologous species, or a recombinant^ engineered 
15 variant. 

In certain embodiments, the sample includes a plurality of variants in 
addition to the target desired selectively to be detected. Each of the plurality of variants 
differs from the target by as few as one nucleotide in the query region that is common 
therebetween, and up to 2, 3, 4, 5, 6, 7, 8, 9, even 10 nucleotides in the query region that is 

2 o common therebetween. Frequently, each of the plurality of variants differs from the target in 

the common query region by no more than 9, 8, 7, 6 and even by no more than 5 
nucleotides. In some embodiments, the sample includes a plurality of target variants in 
addition to the target desired selectively to be detected, the target and each of the variants 
differing in a query region common therebetween by 1, 2, 3, 4, or 5 nucleotides. 

2 5 The methods of the present invention may further comprise a step in which 

single D-loop- or double D-loop-containing targets are separated from other nucleic acids in 
the sample. Such an optional separation step may be performed after an optional 
deproteinization step, and either before or after detection, with separation after 
deproteinization and before detection exemplified in Examples set forth herein below. 

3 o Separation may be based upon inherent physical properties of D-loops or 

dDloops that permit duplexes containing D-loops or dDloops to be distinguished from 
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duplexes lacking such structures. Among such properties is altered mobility during 
electrophoresis, such as gel electrophoresis or capillary electrophoresis. 

Separation may, in the alternative or in addition, be based upon properties 
that are engineered into the Dloops or dDloops. 

5 For example, in certain embodiments, either or both of the first and second 

oligonucleotides may include a moiety that permits subsequent capture, and thus 
separation, of the D-loop ordDloop-containing duplexed DNA. 

The capture moiety is typically one member of a specific binding pair. 

"Specific binding™ refers to the ability of two molecular species concurrently 
present in a heterogeneous (inhomogeneous) sample to bind to one another in preference 
to binding to other molecular species in the sample. Typically, a specific binding interaction 
will discriminate over adventitious binding interactions in the reaction by at least two-fold, 
more typically by at least 10-fold, often at least 100-fold; when used to detect analyte, 
specific binding is sufficiently discriminatory when determinative of the presence of the 
analyte in a heterogeneous (inhomogeneous) sample. Typically, the affinity or avidity of a 
specific binding reaction is least about 10 7 M-\ using at least 10 8 M- 1 to at least about 10 9 M- 
1 , and often greater, including affinities or avidities up to 10 10 WM to 10 12 M- 1 . 

The phrase "specific binding pair" refers to pairs of molecules, typically 
pairs of biomolecules, that exhibit specific binding. 

A wide range of specific binding pair members that can be used for capture 
of oligonucleotides are known in the art, 

Among these are small capture moieties colloquially termed "haptens" 
irrespective of their antigenicity. Such haptens include biotin, digoxigenin, and 
dinitrophenyl, each of which may be incorporated enzymatically through use of prior- 
conjugated nucleotides: biotin-11-dUTP (Molecular Probes, Inc., Eugene, OR, USA); 
biotin-21-UTP, biotin-21-dUTP (Clontech Laboratories, Inc., Palo Alto, CA, USA), 
digoxigenin-11-dUTP, alkali labile, digoxigenin-1 1-UTP (Roche Diagnostics Corp., 
Indianapolis, IN, USA)), and dinitrophenyl-1 1-dUTP (Molecular Probes, Inc., Eugene, OR, 
USA). 
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Biotin may be captured using avidin, streptavidin, captavidin, neutravidin, or 
anti-biotin antibodies. Digoxigenin and dinitrophenyl can be captured using antibodies 
specific for the respective hapten. 

Capture to a solid support, after formation of deproteinization-stable single 
5 or double D-loops at the target query region, permits separation of the D-loop-containing 
nucleic acids from nucleic acids in the sample that do not stably bind the oligonucleotide 
having the capture moiety. 

Alternatively, either the first or the second oligonucleotide may be tethered, 
directly or indirectly, to a solid support at the time of single or double D-loop formation: 
o separating the solid support from the sample after forming deproteinization-stable single or 
double D-loops at the target query region separates the D-loop-containing nucleic acids 
from nucleic acids in the sample that do not bind the tethering oligonucleotide. 

The solid support, in either the capture moiety or tethered embodiments, 
may include one or more surfaces of a unitary object, such as a slide or microliter plate, or 
5 may instead include surfaces of a plurality of discrete objects, such as beads. 

The solid support may be glass, although other solid materials, such as 
amorphous silicon, crystalline silicon, or plastics, may also be used. Such plastics include 
polymethylacrylic, polyethylene, polypropylene, polyacrylate, polymethylmethacrylate, 
polyvinylchloride, polytetrafluoroethylene, polystyrene, polycarbonate, polyacetal, 
o polysulfone, celluloseacetate, cellulosenitrate, nitrocellulose, and mixtures thereof. 

The solid support may be a surface of a bead, or pellet. The beads need 
not be spherical. In addition, the beads may be porous, thus increasing the available 
surface area of the bead available for assay. Bead sizes usefully range from nanometers, 
e.g. 100 nm, to millimeters, e.g. 5 mm, usefully from about 0.2 micron to about 200 microns, 
with beads from about 0.5 to about 5 microns being typical. 

Suitable bead compositions include those used in peptide, nucleic acid and 
organic moiety synthesis, and include, for example, controlled pore glass, plastics, such as 
polystyrene, methylstyrene, acrylic polymers, ceramics, glass, paramagnetic materials, 
titanium dioxide, latex, cross-linked dextrans, cellulose, and nylon. See, e.g., "Microsphere 
Detection Guide" (Bangs Laboratories, Inc., 
http://www.bangslabs.com/products/bangs/guide.php). 
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Usefully, the beads are magnetic, paramagnetic, or superparamagnetic, 
permitting separation of the beads from the liquid sample by application of a suitable 
magnetic field. A variety of such beads may be purchased commercially from Dynal® 
Biotech Inc. (Lake Success, NY USA) and Miltenyi Biotec Inc. (Auburn, CA USA). 

In tethered embodiments, the first or second oligonucleotide maybe 
tethered to the solid support by direct covalent linkage, by direct coordinate bonding (for 
example, as between a thiolated oligonucleotide and a gold or platinum surface), or 
indirectly using any of a variety of bonds. In a useful series of embodiments, the first or 
second oligonucleotide is tethered indirectly to a solid support by a strong noncovalent 
interaction between specific binding partners. 

For example, a biotinylated first or second oligonucleotide may be tethered 
(or, in capture embodiments, captured subsequent to formation of deproteinization-stabie 
single or double D-loops) to a solid support having streptavidin on its surface. Streptavidin- 
coated magnetic beads useful for this purpose are available commercially from a variety of 
vendors: Dynabeads M-280 Streptavidin and Dynabeads M-270 Streptavidin from Dynal® 
Biotech Inc. (Lake Success, NY USA); 1 M m supeiparamagnetic beads covalently coupled 
to a highly pure form of streptavidin from New England Biolabs (Beverley, MA, USA); 
magnetite- and polymer-coated 1 micron beads from Active Motif (Carlsbad, CA, USA); 
streptavidin microbeads from Miltenyi Biotec, Inc. (Auburn, CA, USA). Microbeads having 
anti-biotin antibodies are also useful and are available commercially (Miltenyi Biotec, Inc., 
Auburn, CA, USA). 

A further step of removing the first and second oligonucleotides from the 
target may be performed. This may usefully be accomplished, for example, by subjecting 
the sample to conditions, typically after deproteinization, under which the single or double 
D-loop, but not the target duplex as a whole, is unstable. 

The separation step may be iterated as needed to provide the desired 
degree of separation of target from target variants. 

The separation step can effect a purification of the target - measured as 
the fold-increase in molar ratio of target to a variant concurrently present within the sample 
that differs from the target within a common query region by as few as one nucleotide - by 
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2-fold, 3-fold, 4-fold, 5-fold, by as much as 10-fold, 50-fold, 100-fold, even by as much as 
10 3 -fold, lOMold, 10 5 -fold and even as much as 10 6 -fold or more. 

In various of the embodiments described above, target is separated from 
target variants before, after, or coincident with distinguishing the presence of target in the 
5 nucleic acid sample. 

Detection of target is not required for separation, however. 

Thus, in another aspect, the invention provides a method of separating a 
nonsupercoiled double-stranded nucleic acid target from other nonsupercoiled nucleic acids 
present within a sample of nucleic acids, with selectivity sufficient to separate the target 

1 o from variants that differ from said target by as few as one nucleotide within a common target 

query region. 

The method comprises using a recombinase to mediate formation of at 
least one deproteinization-stable double D loop in the query region of the target, under 
conditions that favor double D loop formation at the target query region over formation at 
1 5 variants that differ from the target by as few as one nucleotide thereof; and then separating 
nucleic acids having deproteinization-stable double D loops from other nucleic acids present 
within said sample. 

Formation conditions are as described above, which description is 
incorporated here by reference in its entirety. 

20 In another aspect, the invention provides a method of separating a 

supercoiled double-stranded nucleic acid target from other supercoiled nucleic acids 
present within a sample of nucleic acids, with selectivity sufficient to separate the target 
from variants that differ from said target by as few as one nucleotide within a common target 
query region. 

2 5 The method comprises using a recombinase to mediate formation of at 

least one deproteinization-stable single D-loop or double D-loop in the query region of the 
target, under conditions that favor single D-loop or double D-loop formation at the target 
query region over formation at variants that differ from the target by as few as one 
nucleotide thereof; and then separating nucleic acids having deproteinization-stable double 
30 D loops from other nucleic acids present within said sample. 
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D 



Formation conditions are as described above, which description is 
incorporated here by reference in its entirety. 

The methods of the present invention may further include the step, after 
single- or double D-loop formation, of extending by polymerase either or both of the first or 
second oligonucleotides. 

In one series of such embodiments, the first or second oligonucleotide is 
extended by a single base. 

Single base extension (SBE) reactions are commonly used in the art for 
querying polymorphic sites. In SBE reactions, target specificity is achieved by hybridizing a 
primer of sufficient length to discriminate among targets; selectivity among target variants is 
achieved by using the sequence-specificity of a subsequent polymerase reaction to identify 
a polymorphic nucleotide situated adjacent to the priming site on the template. The 
extension reaction is limited to a single base in a number of ways, typically by including one 
or more species of chain-terminating nucleotide. For review, see Kirk et a/., Nucl. Acids 
Res. 30:3295-331 1 (2002); Syvanen, Nature Reviews, Genetics 2:930-942 (2001); Kwok, 
Annu. Rev. Genomics Hum. Genet. 2:235-258 (2001). 

Although the methods of the present invention permit target specificity and 
variant selectivity to be achieved in a single step - by formation of deproteinization-stable 
single D-loops or double D-loops selectively at specific targets - an additional SBE reaction 
can provide additional selectivity, additional genotyping information, or facilitate selective 
isolation. In contrast to standard SBE reactions in the art, the target template is double- 
stranded, providing significant advantages in sample preparation and in separation and 
isolation of variants in double-stranded form. 

For example, the single base extension may be performed in the presence 
of one or more distinguishably labeled chain terminating nucleotides. 

In embodiments in which neither first nor second oligonucleotide is 
detectably labeled, label is incorporated, and signal detectable, only if the position queried 
by SBE has one of the desired sequences. In embodiments in which the first and/or second 
oligonucleotide is labeled, further label is incorporated only if the position queried by SBE 
one or more of the desired sequences. 
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The chain-terminating nucleotide may include a capture moiety. 

In embodiments in which neither first nor second oligonucleotide includes a 
capture moiety, the capture moiety is incorporated, and the target separable from the 
sample, only if the position interrogated by SBE one of the desired sequences. In 
5 embodiments in which the first and/or second oligonucleotide includes a capture moiety, an 
additional capture moiety is incorporated only if the position interrogated by SBE has one of 
the desired sequences. If the capture moiety incorporated by single base extension is a 
different molecular species from that incorporated into first or second oligonucleotide, the 
polymerase-incorporated moiety can be used for a first round of separation, with the moiety 

1 o earlier-incorporated into first or second oligonucleotide used for subsequent rounds of 

purification, or wee versa. 

In another series of extension reactions, the first and/or second 
oligonucleotides are extended more than a single nucleotide. A variety of these 
embodiments permit amplification of the target query region, or portions thereof; with 
1 5 formation of single D-loops or double D-loops at targets with selectivity sufficient to 
distinguish target from variants that differ by as few as one nucleotide therefrom, these 
embodiments permit the selective amplification, from inhomogeneous nucleic acid samples, 
of desired variants, such as desired allelic variants. 

Amplification may be unidirectional or bidirectional, isothermal or thermal 

2 o cycling, in any combination. 

For example, either or both of the first or second oligonucleotide may 
include a phage RNA polymerase promoter, such as an SP6, T3, or T7 promoter, 5' to its 
respective complementarity region. Transcription from the phage promoter after dDloop 
formation provides RNA transcripts that include at least a portion of the target query region. 

2 5 Transcription in the presence of one or more labeled nucleotides permits this RNA 

amplification product to be detected, for example by contact to a nucleic acid microarray, 
providing a convenient readout for the genotyping assay. 

Isothermal bidirectional amplification may also be performed, essentially as 
described in U.S. Patent No. 5,223,414, the disclosure of which is incorporated herein by 

3 o reference in its entirety. Briefly, at least two D-loops (single or double, depending on the 

topology of the target) are formed on the target nucleic acid: the first D-loop, as described 
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above, is formed at the query region with selectivity sufficient to distinguish the target from 
variants that differ in the common query region by as few as one nucleotide therefrom; the 
second D-loop is formed either 5' or 3' thereto. 

The two sets of oligonucleotides are hybridized to the duplex target 
sequences in the presence of ATPyS and a reaction mixture comprising dNTPs, RecA 
protein and a DNA polymerase. The reaction is performed below the temperature required 
for thermal dissociation of the two target strands and continued until a desired degree of 
amplification of the target sequence is achieved. The reaction may further include repeated 
additions of (i) DNA polymerase and (ii) RecA protein-coated probes during the course of 
the amplification reactions. Other approaches to amplification are described in United 
States Patent No. 5,223,414, incorporated herein by reference in its entirety. 

In each set of first and second oligonucleotides, the 3' end of one 
oligonucleotide will be internal to the region defined by the two sets of oligonucleotides; free 
hydroxyls at these ends are necessary for the amplification reaction. However, in some 
embodiments of this amplification method, the 3' ends that are external to the region defined 
by the two primer sets may optionally be blocked to inhibit the formation of extension 
products from these ends. 

This amplification method can also be used as a detection method or 
capture method, where detection or capture is accomplished by polymerase-facilitated 
primer extension from the 3'-ends of each oligonucleotide strand in the presence of 
dNTP(s), where one or more dNTP contains a detectable or capture moiety. 

The methods of the present invention may also include a further step of 
contacting nucleic acids of the sample with an enzyme capable of cleaving DNA. Typically, 
the enzyme is capable of cleaving a site within or adjacent to the target query region. 

In one series of embodiments, the single or double D-loop formed 
selectively at the target query region directly blocks cleavage of a restriction site present 
therein. In these embodiments, D-loops are formed at the target query region with 
selectivity sufficient to distinguish the target query region from variants that differ therefrom 
by as few as one nucleotide, according to the methods of the present invention. Before the 
sample is deproteinized, a cleaving enzyme, such as a restriction endonuclease, is added. 
Cleavage sites, such as restriction sites, that are within the target query region are 
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complexed within the protein-containing D-loop and are protected from cleavage. Cleavage 
sites present within the query region of variants that differ from the target by as few as one 
nucleotide are typically not complexed within protein-containing D-loops, and are thus 
cleaved. 

5 In other embodiments, formation of the single or double D-loop selectively 

at the target query region is exploited selectively to alter the methylation state of the target 
or of the target variants; contact thereafter with a cleaving enzyme, such as a restriction 
endonuclease, that is sensitive to the methylation state of the duplex effects selective 
cleavage of target or variants, as desired. 

10 In one such approach, for example, the first and/or second oligonucleotide, 

is methylated. If the first oligonucleotide is methylated, it is methylated at positions that do 
not substantially diminish RecA binding and nucleofilament formation. 

If the target is unmethylated, the deproteinization-stable single or double D- 
loop formed at the target will include at least one hemimethylated duplex region. Contact 

15 with a restriction endonuclease that is unable to cleave hemimethylated DNA results in 
selective cleavage of target variants that differ from target by as few as one nucleotide in 
the query region common therebetween; the target itself is not cleaved. Conversely, 
contact with a restriction endonuclease that requires hemimethylated DNA results in 
selective cleavage of the target; variants that differ from target by as few as one nucleotide 

20 in the query region common therebetween are not cleaved. 

Many restriction enzymes that are sensitive to the presence of methyl 
groups in various target sequences are known and can be used in such methods, including, 
for example, Aatl, Aatll, Accl, Accll, Acclll, Acc65l, AccB7l, Acil, Acll, Adel, Afal, Afel, Afll, 
Aflll, Afllll, Agel, Ahall, Ahdl, Alol, Alul, Alwl, Alw21l, Alw26l, Alw44l, AlwNI, Amal, Aorl, 

25 Aor51HI, Aosll, Apal, ApaLI, Apel, Apol, Apyl, Aqul, AscI, Asp700l, Asp718l, AspCNI, 
AspMI, AspMDI, AtuCI, Aval, Avail, Avill, Bael, Ball, BamFI, BamHI, BamKI, Banl, Banll, 
Bazl, Bbel, Bbill, BbrPI, Bbsl, Bbul, Bbvl, BbvCI, Bca77l, Bce243l, BceAl, Bcgl, BciVI, Bell, 
Bcnl, Bepl, Bfil, Bfi57l, Bfi89l, Bfrl, Bful, Bgll, Bglll, Binl, BloHl, Blpl, BmaDI, Bme216l, 
Bme1390l, Bme1580l, BmeTI, BmgBI, Bnal, Boxl, Bpil, Bpll, Bpml, Bpul, BpulOI, 

3 o Bpu1 1021, Bsal, Bsa29l, BsaAl, BsaBI, BsaHl, BsaJI, BsaWI, BsaXI, Bscl, BscFI, BseCI, 
BseDI, BseGI, BseLI, BseMI, BseMII, BseRI, BseSI, BseXI, Bsgl, Bsh1236l, Bsh1285l, 
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10 



15 



20 



25 



30 



Bsh1365l, BshGI, BshNI, BshTI, BsiBI, BsiEl, BsiHKAl, BsiLI, BsiMI, BsiQI, BsiWI, BsiXI, 
Bsll, Bsml, BsmAI, BsmBI, BsmFI, BsoBI, BsoFI, Bsp49l, Bsp51l, Bsp52l, Bsp54l, Bsp56l, 
Bsp57l, Bsp58l, Bsp59l, Bsp60l, Bsp61l, Bsp64l, Bsp65l, Bsp66l, Bsp67l, Bsp68l, Bsp72i, 
Bsp91l, Bsp105l, Bsp106l, Bsp119l, Bsp120l, Bsp122l, Bsp143l, Bsp143ll, Bsp1286l, 
Bsp2095l, BspAl, BspDI, N.BspDSI, BspEI, BspFI, BspHI, BspJ64l, BspKTBI, BspLI, 
BspLUHIH, BspMI, BspMII, BspPI, BspRI, BspSTSI, BspT104l, BspT107l, BspXI, BspXIl, 
BspZEl, BsrBI, BsrBRI, BsrDI, BsrFI, BsrPII, BssHII, BssKI, BssSI, Bstl, Bst1107l. BstAPI, 
BstBI, BstEII, BstEIM, BstENII, BstF5l, BstGI, BstKTI, BstNl, BstOI, BstPI, BstSCI.BstUI, ' 
Bst2UI, BstVI, BstXI, BstYl, BstZ17l, Bsu15l, Bsu36l, BsuBI, BsuEII, BsuFI, BsuMI, BsuRI, 
BsuTUI, Btcl, Btgl, Btrl, Btsl, Cad, CacSI, Cail, Caull, Cbil, Cbol, Cbrl, Ccel, Ccrl, Ccyl, 
Cfol, Ctrl, CfrfSI Cfr9l, CfrlOl, Cfr13l, Cfr42l, CfrBI, Cful, Clal, Cpel, Cpfl, CpfAl, Cpol, Cspl, 
Csp5l, CspSI, Csp45l, Csp68KII, Cthll, Ctyl, CviAl, CviAII, CviBI, M.CviBIII, CviJI, N.CviPII,' 
CviQI, N.CviQXI, CviRI, CviRII, CviSIII, CviTI, Ddel, Dpnl, Dpnll, Oral, Drali, Dralll, DitJI, ' 
DsaV, Eael, Eagl, EamllMI, Eam1105l, Earl, Ecal, Ecil, Ecl136ll, EclXI, Ecl18kl, Eco24l, 
Eco31l, Eco32l, Eco47l, Eco47lll, Eco52l, Eco57l, Eco72l, Eco88l, Eco91l, Eco105l, 
Eco147l, Eco1831l, EcoAl, EcoBI, EcoDI, EcoHl, EcoHK31l, EcoKI, Eco065l, EcoC-1091, 
EcoPI, EcoP15l, EcoRI, EcoRII, M.EcoRII, EcoRV, EcoR124l, EcoR124ll, EcoT22l, Ehel! 
EsaBC3l, EsaBC4l, EsaLHCI, Esp3l, Esp1396l, Faul, Fbal, FnuDII, FnuEl, Fnu4HI,' Fokl,' 
Fsel, Fspl, Fsp4HI, Gsul, Haell, Haelll, HaelV, Hapll, Hgal, HgiAl, HgiCI, HgiCII, HgiDI, ' 
HgiEl, HgiHI, Hhal, Hhali, Hinll, HinSI, HinPII, Hindi, Hindi), Hindlll, Hinfl, Hpal, Hpall,' 
Hphl, HpySI, Hpy99l, Hpy99ll, Hpy188l, Hpy188lll, HpyAIII, HpyAIV, HpyCH4lll, HpyCH4IV, 
Hsol, Ital, Kasl, Kpnl, Kpn2l, Kspl, Ksp22l, KspAl, Kzo9l, LlaAl, LlaKR2l, Mabl, Maell, 
Maml, Mbil, Mbol, Mboll, Mfll, Mlsl, Mlul, Mlu9273l, Mlu9273ll, Mlyl, Mmel, Mmell, Mnll, 
Mral, Mscl, Msel, MsII, Mspl, M.Mspl, MspAII, MspBI, MspR9l, Mssl, Mstll, MthTI, MthZI, 
Muni, Mval, Mva1269l, Mvnl, Mwol, Nael, Nanll, Narl, Neil, NciAl, Ncol, Ncul, Ndel, Ndel'l, 
NgoBV, NgoBVIII, NgoCI, NgoCII, NgoFVII, NgoMIV, NgoPII, NgoSII, NgoWl, Nhel," Nlalll" 
NlalV, NlaX, NmeSI, NmuCI, NmuDI, NmuEl, Notl, Nrul, Nsbl, Nsil, Nspl, NspV, NspBII, ' 
NspHI, Pad, Pael, PaeR7l, Pagl, Paul, Pdil, Pdml, Pei9403l, Pfal, PA23II, PflFI, PflMI, Plel, 
Ple19l, PmaCI, Pmel, Pmll, Ppil, PpuMI, PshAI, Psp5ll, Psp1406l, PspGI, PspOMI, PspPI, ' 
PstI, Psul, Psyl, Pvul, Pvull, RalSI, RalF40l, RflFI, RflFII, Rrh4273l, Rsal, Rshl, RspXI, Rsrl, 
RsrII, Sad, Sacll, Sail, SalDI, Sapl, Sau96l, Sau3239l, Sau3AI, SauLPI, SauMI, Sbo13l, 
Seal, Scg2l, Schl, ScrFI, Sdal, Sdul, SenPI, SexAl, SfaNI, Sfil, Sfol, Sful, Sgfl, SgrAI, Sinl, 
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Smal, Smll, SnaBI, Snol, Soil, Spel, Sphl, Spll, Spol, Srfl, Sru30DI, SscLII, Sse9l, 
Sse8387l, Ssol, Ssoll, SspRFI, Sstl, Sstll, Sth368l, Stsl, Stul, StyD4l, StyLTI, StyLTIII, 
StySJI, StySPI, StySQI, Taal, Tail, Taql, Taqll, TaqXI, Tfil, TBI, Thai, TBI, Tsel, Tsp45l, 
Tsp509l, TspRI, Tth111l, TthHB8l, Van91l, VpaKHBI, Vspl, Xapl, Xbal, Xcel, Xcml, Xcyl, 
5 Xhol, Xholl, Xmal, Xmalll, Xmil, Xmnl, Xorll, Xspl, and Zanl. Similarly, many enzymes that 
can be used to methylate the duplex target nucleic acid molecule (methylases) are known, 
including, for example, M.AacDam, M.AaoHemKP, M.AarAIP. M.Aatll, M.Abrl, M.Accl, 
M.Acclll, M.Acil, M.Acll, M.Aflll, M.Afllll, M.AfuHemKP, M.AfuORF1409P, M.AfuORF1715P, 
M.AfuORF2345P, M.Agel, M.Ahdl, M.AimAI, M.AimAII, M.Alol, M.AIul, M.AIwl, M.AIw26l, 

l o M.Apal, M.ApaLI, M.ApeKHemKP, M. ApeKORF73P, M.ApeKORF446P, M.ApeKORF554P, 
M.ApeKORF872P, M.ApeKORF1835P, M.ApeKORF2002P, M.Apol, M.Aqul, MAscI, 
M.Asel, M.AsiSI, M.AspCNI, M.AthBP, M.Athl, M.Athlll, M.AthlVP, MAthVP, M.AthVIP, 
M.AthVIIP, M.AthVIIIP, M.AtuCHemKP, M.AtuCHemK2P, M.AtuCHemK3P, M.AtuCORF8P, 
M.AtuCORF1453P, M.AtuCORF1997P, M.Aval, M.Avall, M.Avalll, M.AvalVP, M.AvaV, 

15 M.AvaVI, MAvaVII, M.AvaVIH, M.AvalX, M.Avrll, M.Babl, M.Bali, M.BamHI, M.BamHIl, 
M.Banl, M.Banll, M.Banlll, M.BbuB31HemKP, M.Bbulp250RF2P, M.Bbulp250RF29P, 
M.Bbulp560RF67P, M.Bbvl, M.BbvCIA, M.BbvCIB, M.Bce10987IP, M.BceAIA, M.BceAlB, 
M.Bcgl, M.Bchl, M.Bcll, M.BcnIA, M.BcnIB, M.BepI, M.BfalA, M.BfalB, BfaHemKP, 
M.BfaORFC113P, M.BfaORFC143P, M.BfaORFC157P, M.BfaORFC196P, 

2 0 M.BfaORFC198P, M.BfaORFC205P, M.BfaORFC223P, M.BfaORFC240P, M.BfilA, M.BfilB, 
M.BfuAIA, M.BfuAlB, M.Bgll, M.Bglll, M.BhaHemKP, M.BhaORF3508P, M.BhaORF3535P, 
M.BhaORF4003AP, M.BhaORF4003BP, M.BIpl, M.BmrIA, M.BmrIB, M.BollP, M.BolllP, 
M.BpmlA, M.BpmlB, M.BpulOIA, M.BpulOIB, M.BsalA, M.BsalB, M.BsaAl, M.BsaJI, 
M.BsaWI, M.BscGIA, M.BscGIB, M.Bse634l, M.BseCI, M.BseDI, M.BseMII, M.BseRIA, 

2 5 M.BseRIB, M.BseYI, M.BsgIA, M.BsgIB, M.Bsll, M.BsmlA, M.BsmlB, M.BsmAI, M.BsmBI, 
M.BsoBI, M.Bsp6l, M.Bsp98l, M.BspCNIA, M.BspCNIB, M.BspHI, M.BspLUHIIIA, 
M.BspLUHIIIB, M.BspLUHIIIC, M.BspMIA, M.BspMIB, M.BspRI, M.BsrIA, M.BsrIB, 
M.BsrBIA, M.BsrBIB, M.BsrDIA, M.BsrDIB, M.BsrFI, M.BssHI, M.BssHII, M.BssSI, 
M.BstF5l, M.BstLVI, M.BstNBI, M.BstNBII, M.BstVI, M.BstYl, M.Bsu36l, M.Bsu168IP, 

30 M.Bsu168IIP, M.Bsu168IIIP, M.BsuBI, M.BsuFI, M.BsuRI, M.BusHemKP, M.BusHemK2P, 
M.Cac824l GCNGC, M.Cac824HemKP, M.Cac8240RF1222P, M.Cac824ORF2309P, 
M.Cac8240RF3358P, M.Cac8240RF3534AP, M.Cac8240RF3534BP, M.CauJHemKP, 
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M.CauJORFC101P, M.CauJORFC102P, M.CauJORFC103P, M.CauJORFC104P, 
M.CauJORFC107P, M.CauJORFCHOP, M.CauJORFC111P, M.CauJORFC112P, 
M.CauJORFC113P, M.CauJORFC114P, M.CauJORFC116P, M.CauJORFC117P, 
M.CauJORFC119P, M.CcrMI GANTC, M.CcrMHemKP, M.CcrMHemK2P, M.CcrMHemK3P, 
M.CcrMORF620P, M.CcrMORF1033P, M.CcrMORF3626P, M.Cdi630HemKP, 
M.Cdi630ORFC636P, M.Cdi630ORFC861P, M.Cdi630ORFC898P, M.Cdi630ORFC633aP, 
M.Cdi630ORFC633bP, M.CelHemKP, M.Ceql, M.CfrSI, M.CfrlOl, M.CfrBI, M.Cgll, 
M.ChuAHemKP, M.ChuAORFC123P, M.ChuAORFC127P, M.Cjel, M.CjeNHemKP, 
M.CjeN0RF31P, M.CjeNORF208P, M.CjeNORF690P, M.CjeNORF1051P, 
M.CjeNORF1553P, M.CmuHemKP, M.CpalOWAlP, M.CpnAHemKP, M.CpnHemKP, 
M.CpnJHemKP, M.CsyAlP, M.CsyBIP, M.CtrHemKP, M.CviAl, M.CviAII, M.CviAIIIP, 
M.CviAIV, M.CviAV, M.CviBI, M.CviBIII, M.CviJI, M.CviPl, M.CviQI, M.CviQIII, M.CviQVP, 
M.CviQVI, M.CviQVII, M.CviRI, M.CviSI, M.CviSII, M.CviSIII, M.CviSVlP, M.Dcal, M.Dcall, 
M.Ddel, M.DhaHemKP, M.DhaORFC135P, M.DhaORFC140P, M.DhaORFC141P, 
M.DhaORFC512P, M.DmeORFAP, M.DmeORFBP, M.DnolP, M.DpnllA, M.DpnIIB, 
M.Dralll, M.DraRHemKP, M.DraRORFB138P, M.DraRORFC20P, M.DrelP, M.DsaV, 
M£ael, M.Eagl, M.EarIA, M.EarIB, M.Ecal, M.Ecl18kl, M.Eco31l, M.Eco47l, M.Eco47ll, 
M.Eco56l, M.Eco57IA, M.Eco57IB, M.Eco72l, M.EcoAl, M.EcoBI, M.Eco67Dam, M.EcoEl, 
M.EcoHK31l, M.EcoKI, M.EcoKIIP, M.EcoK12AhemKP, M.EcoKDam, M.EcoKDcm, 
M.EcoKHemKP, M.EcoK0157DamP, M.EcoK0157DcmP, M.EcoK0157HemKP, 
M.EcoK0157HemK2P, M.EcoK0157HemK3P, M.EcoK01570RF1196P, 
M.EcoKO157ORF1780P, M.EcoK01570RF2981P i M.EcoK01570RF4134P, 
M.EcoKO157ORF5307P, M.EcoNI, M.EcoN150RF52P, M.EcoN150RF58P, M.EcoO109i, 
M.Eco0157IP, M.Eco0157DamP, M.Eco0157DcmP, M.Eco0157HemKP, 
M.Eco0157HemK2P, M.Eco0157HemK3P, M.Eco0157HemK4P, M.Eco01570RF1454P, 
M.EcoO157ORF2060P, M.Eco01570RF2389P, M.Eco01570RF3349P, 
M.EcoO1570RF4622P, M.Eco01570RF5947P, M.EcoPI, M.EcoP15l, M.EcoPIDam, 
M.EcoRI, M.EcoRII, M.EcoRV, M.EcoR9l, M.EcoR124l, M.EcoR124ll, M.EcoTIDam, 
M.EcoT2Dam, M.EcoT4Dam, M.EcoVIII, M.EcoVT2Dam, M.Eco933WdamP, M.Eco29kl, 
M.Ecoprrl, M.EfaAHemKP, M.EfaAORFC149P, M.EfaAORFC151P, M.EfaAORFC154P, 
M.EfaORFAP, M.EfaORFC154P, M.EniiP, M.EsaBCII, M.EsaBC2l, M.EsaBC3l, 
M.EsaBC4l, M:EsaBS1l, M.EsaBS2l, M.EsaDixll, M.EsaDix2l, M.EsaDix3l, M.EsaDix4l, 
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M.EsaDix5l, M.EsaDix6l, M.EsaDix7l, M.EsaLHCI, M.EsaLHCII, M.EsaLHCIII, 
M.EsaLHC2l, M.Esp3l, M.FacHemKP, M.FacHemK2P, M.FacORFC156P, 
M.FacORFC157AP, M.FacORFC157BP, M.FacORFC158P, M.FacORFC160P, M.Faul, 
M.FnuDI, M.Fnu4HI, M.Fokl, M.Fsel, M.Fspl, M.Fsp7605IP, M.Fvi3l, M.Ggal, M.GshIP, 
5 M.Gsul, M.H2I, M.Haell, M.Haelll, M.HaelV, M.HgalA, M.HgalB, M.HgiBI, M.HgiCI, 
M.HgiCII, M.HgiDI, M.HgiDII, M.HgiEl, M.HgiGI, M.Hhal, M.Hhall, M.HinHPIDam, 
M.HinPII, M.Hincll, M.Hindi, M.Hindll, M.Hindlll, M.HindIV, M.HindV, M.HindHemKP, 
M.HindHemK2P, M.HindORF1056P, M.HindORF1286P, M.Hinfl, M.Hpal, M.Hpall, 
M.HphIA, M.HphIB, M.Hpyl, M.Hpy8l, M.Hpy99l, M.Hpy99ll, M.Hpy99lll, M.Hpy99IV, 

io M.Hpy99VA, M.Hpy99VBP, M.Hpy99VI, M.Hpy99VII, M.Hpy99VIII, M.Hpy99IX, M.Hpy99X, 
M.Hpy99XI, M.Hpyl 66DP, M.Hpy166EP, M.Hpy166FP, M.Hpy166IVP, M.Hpy178IP, 
M.Hpy188l, M.Hpy188ll, M.Hpy188lll, M.HpyAI, M.HpyAIIA, M.HpyAIIB, M.HpyAIII, 
M.HpyAIV, M.HpyAV, M.HpyAVIA, M.HpyAVIB, M.HpyAVII, M.HpyAVIII, M.HpyAlX, 
M.HpyAX, M.HpyAXI, M.HpyAHemKP, M.HpyAORF263P, M.HpyAORF369P, 

15 M.HpyAORF463P, M.HpyAORF481P ( M.HpyAORF483P, M.HpyAORF850P, 

M.HpyAORF1354P, M.HpyAORF1370P, M.HpyAORF1403P, M.HpyAORF1472P, 
M.HpyAORF1517P, M.HpyAORF1522P, M.HpyCH4IV, M.HpyCH4V, M.Hpy166GP, 
M.Hpy166HP, M.Hpy99HemKP, M.Hpy990RF415P, M.Hpy99ORF430P, 
M.Hpy990RF433P, M.Hpy990RF613P, M.Hpy990RF786P, M.Hpy990RF846P, 

20 M.Hpy99ORF1012P, M.Hpy990RF1284P, M.Hpy990RF1296P, M.Hpy990RF1365P, 
M.Hpy99ORF1409P, M.Hpy990RF1411P, M.Hpy990RF1423P, M.HsalB, M.HsallP, 
M.HsalllA, M.HsalllB. M.HsalVP, M.HsaHemKP, M.HsaHemK2P, M.HspNIP, 
M.HspNHemKP, M.HspNORF106P, M.HspNORF1543P, M.HspNORF2242P, 
M.HspNORF6135AP, M.HspNORF6135BP, M.Kpnl, M.Kpn2l, M.KpnAI, M.LdvlP, M.LesIP, 

25 MlinHemKIP, M.LIal, M.LIa82l, M!la1403l, M.LIa2009IP, M.LIa2614l, M.LIaAIA, MilaAlB, 
M.LIaBI, M.UaBIIP, M.LIaBIII, M.LIaCI, M.LIaDII, M.LIaDCHIA, M.LIaDCHIB, M.UaFI, 
MllaGI, M.LIa1403HemKP, M.UaKR2l, M.LIa509ORFAP, M.LIaPI, M.LIdl, M.LmoA118l, 
M.Lsp1109l, M.Maml, M.MarMIP, M.MbaHemKP, M.MbaORFC198P, M.MbaORFC203P, 
M.MbaORFC206P, M.MbaORFC207P, M.MbaORFC531P, M.MbaORFC533P, M.MbolA, 

3 o M.MbolB, M.MbollA, M.MbollB, M.MboAHemKP, M.MboAORFC210P, M.MboAORFC263P, 
M.MboAORFC271P, M.Mca27343l. M.Mfel, M.MgeHemKP, M.MgeORF184P, M.MgrlP, 
M.Mjal, M.Mjall, M.Mjalll, M.MjalVP, M.MjaV, M.MjaVI, M.MjaHemKP. M.MjaORF132P, 
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M.MjaORF563P, M.MjaORF1200P, M.MjaORF1220P, M.M]aORFCL42P, M.MIeHemKAP, 
M.MIeHemKBP, M.MIeSHemKP, M.MIeSORF756P, M.MIoHemKP, M.MIoORFmll9056P, 
M.MIoORFmll9333P, M.MIoORFmlr7520P, M.MIoORFmlr7992P, M.MIoORFmlr8517P, 
M.MIul, M.MIyl, M.MmaMHemKP, M.MmaMHemK2P, M.MmaMHemK3P, 
M.MmaMORF527P, M.MmaMORFC170P, M.MmaMORFC174P, M.MmaMORFC175AP, 
M.MmaMORFCI 75BP, M.MmaMORFC525P, M.MmaMORFC527P, M.Mmul, M.MmullP, 
M.MmulllA, M.MmulllB, M.MmylP, M.MneAORF1590P, M.MnllA, M.MnllB, M.MpnIP, 
M.MpnHemKP, M.MpnQRFDP, M.MpuCHemKP, M.MpuCORF430AP, M.MpuCORF430BP, 
M.MpuCORF810AP, M.MpuCORF810BP, M.MpuCORF1850AP, M.MpuCORF1850BP, 
M.MpuCORF3960P, M.MpuCORF3970P, M.MpuCORF3980P, M.MpuCORF4330P, 
M.MpuCORF4800P, M.MpuCORF6780P, M.MpuCORF6880P, M.MpuUI, M.MsaRVIP, 
M.MsaV2IP, M.MsaV3IP, M.MsaV4IP, M.Msel, M.Mspl, M.MspAII, M.MspMCHemKP, 
M.MspMCHemK2P, M.MspMCHemK3P, M.MspMCORFC183P, M.MspMCORFC184P, 
M.MspMCORFC186P, M.MspMCORFC187AP, M.MspMCORFC187BP, M.MthHHemKP, 
M.MthHORF495P, M.MthHORF724P, M.MthHORF942P, M.MthTI, M.MthZI, 
M.MtuCTHemKP, M.MtuCTORF2076P, M.MtuCTORF2082P, M.MtuCTORF2826P, 
M.MtuCTORF3363P, M.MtuHHemKP, M.MtuHORF2756P, M.MtuHORF3263P, M.Munl, 
M.Mval, M.Mwol, M.Nael, M.NarAORFC306P, M.Ncol, M.NcrNI, M.Ndel, M.NeuHemKP, 
M.NeuORFC215AP, M.NeuORFC215BP, M.NeuORFC218P, M.NeuORFC219P, M.NgoBI, 
M.NgoBIIP, M.NgoBV, M.NgoBVIIIA, M.NgoBVIIIB, M.NgoFVII, M.NgoLII. M.NgoLHemKP, 
M.NgoMIV, M.NgoMX, M.NgoMXV, M.NgoMorf2P, M.NgoPII, M.NgoSII, M.Ngo125VIIP, 
M.Nhel.M.NIalll, M.NIalV, M.NlaX, M.NrneAHemKP, M.NmeAHemK2P, M.NmeAORF59P, 
M.NmeAORF191P, M.NmeAORF427P, M.NmeAORF532P, M.NmeAORF561P, 
M.NmeAORF1035P, M.NmeAORF1038P, M.NmeAORF1385P, M.NmeAORF1432P, 
M.NmeAORF1453P, M.NmeAORF1467P, M.NmeAORF1500P, M.NmeAORF1590P, 
M.NmeBIA, M.NmeBIB, M.NmeBF13P, M.NmeBHemKP, M.NmeBHemK2P, 
M.NmeBORF76P, M.NmeBORF826P, M.NmeBORF829P, NtneBORF1033P, 
M.NmeBORF1223P, M.NmeBORF1261P, M.NmeBORF1290P, M.NmeBORF1375P, 
M.NmeB1940ORF1P, M.NmeDIP, M.Nme2120ORF1P, M.NmeSI, M.NmeST11170RF1P, 
M.Notl, M.NpuHemKP, M.NpuORFC221P, M.NpuORFC222P, M.NpuORFC224P, 
M.NpuORFC226P, M.NpuORFC227P. M.NpuORFC228P, M.NpuORFC229P, 
M.NpuORFC230P, M.NpuORFC231P, M.NpuORFC232P, M.NpuORFC234P, 
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M.NpuORFC237P, M.NpuORFC242P, M.Nspl, M.Nsplll, M.NspV, M.NspHI, M.OkrAI, 
M.OsalP, M.PabHemKP, M.PabORF588P, M.PabORF1205P, M.PabORF1283P, 
M.PabORF2149P, M.PabORF2246P, M.PabORF2317P, M.Pac25l, M.PaePAHemKP, 
M.PaePAHemK2P, M.PaePAHemK3P, M.PaePAHemK4P, M.PaePAORF370P, 
5 M.PaePAORF2735P, M.PaeR7l, M.PcopB4P, M.PflMI, M.PflPHemKP, M.PflPHemK2P, 
M.Pf!PHemK3P, M.PflPHemK4P, M.PfuAlP, M.Pgil, M.PhaAl, M.PhaBI, M.PhiBssHII, 
M.PhiChll, M.PhiGIP, M.PhiHIAP, M.PhiHIBP, M.PhiHIl, M.PhiMx8l, M.Phi3TI, M.Phi3TII 
TCGA, M.PhoHemKP, M.PhoORF39P, M.PhoORF338P, M.PhoORF584P, 
M.PhoORF905P, M.PhoORF1032P, M.PhoORF1948P, M.PIel, M.PIiMCI, 

10 M.PmaMEDHemKP, M.PmuDamP, M.PmuHemKP, M.PmuHemK2P, M.PmuHemK3P, 

M.PmuORF698P, M.PmuORF1537P, M.Ppu21l, M.ProHemKP, M.ProORFC262P, M.Psal, 
M.PshAI, M.PspGI, M.PspPI, M.Pstl, M.Pstll, M.Pvull, M.QpalP, M.RcoHemKP, 
M.RcoORF690P, M.RcoORF1350P, M.RhmlP, M.Rho11sl, M.Rhvl, M.RIe39BI, 
M.RmeADamP, M.RmeAHemKP, M.RmeAHemK2P, M.RmeAORFC243P, 

15 M.RmeAORFC246P, M.RnolP, M.RpaORFC296AP, M.RpaORFC296BP, 

M.RpaORFC298P, M.RpaORFC302P, M.RpaORFC303P, M.RprHemKP, M.Rsal, 
M.RshYP, M.RshXP, M.RspAlP, M.RspDORFC282AP, M.RspDORFC282BP, 
M.RspDORFC283P, M.RspDORFC285P, M.RspDORFC291P, M.Rsrl, M.SPBetal, M.SPRI, 
M.SacI, M.Sacll, M.Sall, M.SaplA, M.SaplB, M.Sau42l, M.Sau96l, M.Sau3AI, 

20 M.SauMu50HemKP, M.SauMu50ORF431P, M.SauMu50ORF1808P, M.SauN315HemKP, 
M.SauN3150RF391P, M.SauN3150RF1626P, M.Scal, M.SceHemKP, M.SciSpVIP, 
M.ScoA3HemKP, M.ScrFIA, M.ScrFIB, M.SenPi, M.SeqHemKP, M.SeqORFC20AP, 
M.SeqORFC20BP, M.SeqORFC57P, M.SeqORFC175P, M.SeqORFC272P, 
M.SeqORFC395P r M.SeqORFC448P, M.Sfil, M.Sfol, M.SgrAI, M.Sinl, M.Smal, M.Small, 

25 M.SmelP, M.SmeHemKIP, M.SmeHemK2P, M.SmeORF2296P, M.SmeORF3763P, 
M.SnaBI, M.SoblP. M.Spel, M.Sphl, M.Spn526IP, M.Spn5252IP, M.SpnHemKP, 
M.SpnORF505P, M.SpnORF886P, M.SpnORF1221P, M.SpnORF1336P, 
M.SpnORF1431P, M.SpnRHemKP, M.SpnRHemK2P, M.SpnRORF449P, 
M.SpnRORF790P, M.SpnRORF1101P, M.SpnRORF1287P, M.SpnRORF1665P, M.Spoml, 

3 o M.SpomHemKP, M.SprHemKP, M.SpyHemKP, M.SpyORFI 077P, M.SpyORFI 906P, 
M.Sse9l. M.SsfORF265P, M.Ssol, M.Ssoll, M.Sspl, M.Ssp6803l, M.Ssp6803HemKP, 
M.Ssp6803ORF729P, M.Ssp6803ORF1803P, M.SssI, M.Ssu2479IA, M.Ssu2479IB, 
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M.Ssu4109IA, M.Ssu4109IB, M.Ssu4961IA, M.Ssu4961IB, M.Ssu8074IA, M.Ssu8074IB, 
M.Ssu11318IAP, M.Ssu11318IBP, M.SsuDATIIA, M.SsuDATIIB, M.Sth368l, 
M.SthER35IP, M.SthSfi10RF535P, M.SthStOIP, M.SthSt8IP, M.StoHemKP, 
M.StoORF335P, M.StsI, M.StyCORFAP, M.StyEMI, M.StyDam, M.StyDcmlP, M.StyLTI, 
5 M.StyLTIII, M.StyLT2DamP GATC, M.StyLT2DcmP, M.StyLT2FelsDamP, M.SlyLTHemKP, 
M.StyLT2HemKP, M.StyLT2HemK2P, M.StyLT20RF357P, M.StyLT20RF3386P, 
M.StyLT20RF4525P, M.StyR270RF41P, M.StyR270RF43P, M.StyR270RF154P, 
M.StySBLI, M.StySBLIIP, M.StySJI, M.StySKI, M.StySPI, M.Swal, M.Taql TCGA, M.Tfil 
GAWTC, M.TfuORFC321AP, M.TfuORFC321BP, M.TfuORFC325P, M.TfuORFC327P, 
M.Thai, M.ThaHemKP, M.ThaHemK2P, M.ThaORF318P, M.ThaORF644P, 
M.ThaORF1168P, M.ThaORF1336AP, M.ThaORF1336BP, M.ThaORF1417P, M.TIil, 
M.Tmal, M.TmaHemKP, M.Tpal, M.TpaHemKP, M.Tsel, M.Tsp45l, M.Tsp509l, M.TspRI, 
M.Tth111l, M.TthHB8l, M.TvoORF124AP, M.TvoORF124BP, M.TvoORF442P, 
M.TvoORF681P, M.TvoORF725P, M.TvoORF849P, M.TvoORF1192P, M.TvoORF1400P, 
M.TvoORF1413P, M.TvoORF1416P, M.TvoORF1436P, M.UurHemKP, M.UurORF98P, 
M.UurORFIOOP, M.UurORF477P, M.UurORF528P, M.Van91ll, M.VchOIIP, M.VchADamP, 
M.VchAHemKP, M.VchAHemK2P, M.VchAORF198P, M.VchAORF1769P, M.Vspl, M.Xaml, 
M.Xbal, M.Xcml, M.Xcyl, M.XfaAORFC332P, M.XfaAORFC333P, M.XfaAORFC340P, 
M.XfaHemKP, M.XfaHemK2P, M.XfaORF297P, M.XfaORF641P, M.XfaORF935P, 
M.XfaORF1774P, M.XfaORF1804P, M.XfaORF1968P, M.XfaORF2297P, M.XfaORF2313P, 
M.XfaORF2723P, M.XfaORF2724P, M.XfaORF2728P, M.XfaORF2742P, M.Xhol, M.Xholl, 
M.Xtal, M.Xmal, M.XmaXhl, M.Xmnl, M.Xorll, M.YpelP, M.Zmal, M.ZmallA, M.Zmalll, 
M.ZmaV, and M.ZmaORFAP. 

In another methylation-based approach, analogous to the Achilles' heel 
cleavage approach described in Szybalski, Curr. Opin. Biotechnol. 8:75-81 (1997) and 
Ferrin et a/., Science 254:1494-7 (1991), single or double D-loops are formed at the target 
query region, depending on target topology; in accordance with the methods of the present 
invention, D-loops are formed with selectivity sufficient to distinguish target from variants 
that differ by as few as one nucleotide therefrom within a query region common 
therebetween. Before deproteinization, a methyltransferase, such as EcoRI methylase, is 
added. Methylation sites that are within the target query region are complexed within the 
protein-containing D-loop and are protected, from methylation. Methylation sites within 



WO 03/027640 



60 



PCT/US02/31073 



variants that differ from target by as few as one nucleotide in the common query region are 
not so protected, and are subject to methylation. 

The sample is deproteinized and contacted with a cleaving enzyme. 
Contact with a cleaving enzyme, such as a restriction endonuclease, that preferentially 
5 cleaves methylated sequences cleaves variants, not target; contact with an endonuclease 
that preferentially cleaves unmethylated DNA leads to target cleavage. 

In embodiments in which the sample is methylated, demethylase enzymes 
may be employed in the converse of the above-described approach. . 

In another approach, single or double D-loops, depending on target 
l o topology, are formed at the target query region with selectivity sufficient to discriminate 
variants that differ from target by as few as one nucleotide; the single or double D-loop are 
used to generate one or more specific cleavage sites that thus appear exclusively within the 
target. 

In these embodiments, type lis restriction endonucleases are particularly 

15 useful. 

Type lis restriction enzymes have distinct DNA binding and cleavage 
domains; therefore, they recognize a specific sequence but cleave a defined distance away, 
For example, the Type lis restriction enzyme, Fo/d, binds to a site containing the sequence 
GGATG and cleaves 9 and 13 base pairs away from the recognition site in a staggered 
20 fashion. 

Other Type lis and Type lls-like enzymes can be used including, for 
example, restriction enzyme Stsl, Group I intron homing endonuclease I-Tevl, R2 
retrotransposon endonuclease R2, P1 transposase SCEI and bacterial recombination 
RecBCD. Other homing endonucleases include, for example, F-Scel, F-Scell, F-Suvl, F- 

25 Tevl, F-Tevil, l-Amal, l-Anil, l-Bmol, l-Ceul, l-CeuAIIP, l-Chul, l-Cmoel, l-Cpal, l-Cpall, I- 
Crel, l-CrepsblP, l-CrepsbllP, l-CrepsblllP, l-CrepsblVP, l-Csml, l-Cvul, l-CvuAIP, l-Ddil, I- 
Ddill, l-Dirl, l-Dmol, l-Hmul, l-Hmull, l-HspNIP, l-Llal, I-Msol, l-Naal, l-Nanl, l-NcllP, l-NgrIP, 
l-Nitl, l-Njal, l-Nsp236IP, l-Pakl, l-PbolP, LPculP, l-PcuAl, l-PcuVI, l-PgrIP, l-PoblP, l-Porl, 
l-PorllP, l-PpblP, l-Ppol, l-SPBetalP, l-Scal, I-Scel, l-Scell, l-Scelll, l-ScelV, l-SceV, I- 

3 o SceVI, l-SceVII, l-SexlP, l-SnelP, l-SpomCP, l-SpomlP, l-SpomllP, l-SqulP, I-Ssp6803l, I- 
SthPhiJP, l-SthPhiST3P, !-SthPhiS3bP, l-TdelP, I-Tevl, l-Tevll, l-Tevlll, l-UarAP, I- 
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UarHGPAlP, l-UarHGPA13P, l-VinIP, l-ZbilP, Pl-Mgal, Pl-Mtul, PI-MtuHIP, PI-MtuHIIP, PI- 
Pful, Pl-Pfull, Pl-Pkol, Pl-Pkoll, Pl-Pspl, PI-Rma43812IP, PI-SPBetalP, Pl-Scel, Pl-Tful, Pl- 
Tfull, Pl-Thyl, Pl-Tlil, and PI-TliH. 

Type IIB restriction enzymes that cleave on both sides of the binding site 
5 may also be used, such as Bcg\ and BpH. 

In one series of embodiments, a recognition site for a Type lis, Type Ms-like 
or Type IIB restriction enzyme is formed by an extension on either or both of the first and 
second oligonucleotides; within the single or double D-loop (depending upon target 
topology), the extension folds back on the respective oligonucleotide, forming a double- 
o stranded portion containing the recognition site. 

Alternatively, a recognition site for a Type lis, Type lls-like or Type IIB 
restriction enzyme can be formed by recognition regions within the first and second 
oligonucleotides; the recognition regions are situated outside the regions of 
complementarity to target, and are complementary therebetween. 

5 In yet other approaches, a cleavage moiety or peptide having nonspecific 

endonucleolytic activity is targeted to and cleaves the target. The cleavage moiety or 
peptide may be linked directly to the first or second oligonucleotide or may be linked thereto 
through a specific binding pair interaction, such as a biotin-streptavidin interaction. 
Examples of such an endonucleolytic moiety include, for example, EDTA-Fe" (for iron/EDTA 

> facilitated cleavage), non-specific phosphodiesterases, and non-specific restriction 
endonucleases. 

In yet other approaches, the single or double D-loop may be cleaved with a 
single-strand specific endonuclease, for example, S1 nuclease, or a resolvase that 
recognizes the double D-loop structure, such as the MRE11. 

Selective cleavage of target variants by any of the above-described 
methods may be used to deplete samples of such variants, either as a complement to 
affirmative separation of the target, or alone. Selective cleavage finds particular use when 
the target and variants thereof are present within replicable vectors, and reduces the 
percentage of contaminating variants in subsequent host cell transformations. 
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Selective cleavage may also be used to facilitate selective cloning of 
targets, such as cloning of specific allelic variants of a gene of interest. 

In certain of these embodiments, at least two D-loops (either single or 
double, depending upon target topology) are formed on the target nucleic acid: the first D- 
5 loop, as described above, is formed at the query region with selectivity sufficient to 

distinguish the target from variants that differ in the common query region by as few as one 
nucleotide therefrom; the second D-loop is formed at a second site distant thereto. 

Applying one of the selective cleavage methods set forth above, cleavage 
is effected at the first site and at the second site. With formation selectively at the target 
l o query region, fragments unique to the target are obtained. 

Such unique fragments also permit the detection of targets and/or variants 
thereof by restriction fragment length polymorphism analysis. 

In another series of such embodiments, selective cleavage is used to 
prevent subsequent amplification. In embodiments in which variants are selectively 

1 5 cleaved, target can be amplified with selectivity sufficient to distinguish target from variants 
that differ by as few as one nucleotide therefrom. Amplification may be according to the D- 
loop methods above-described, or may instead be methods that do not rely upon D-loop 
formation, such as polymerase chain reaction (PCR), nucleic acid sequence-based 
amplification (NASBA), self-sustained sequence recognition (3SR), ligase chain reaction 

2 o (LCR), transcription-mediated amplification (TMA), rolling circle amplification (RCA), and 
strand displacement amplification (SDA). 

The methods of the present invention may further comprise the additional 
step of quantifying the abundance of target. Abundance may be measured as absolute 
abundance of target within a nucleic acid sample or as abundance relative to one or more 

2 5 additional targets and/or target variants. 

Absolute abundance of a target within a sample may, for example, usefully 
signal the presence or absence of an allelic variant associated with a defined phenotype, 
such as predisposition to or presence of a disease. Absolute abundance of one or more 
targets within a sample may additionally be used to categorize or identify a sample, as in 

3 o forensic applications. 
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To measure relative abundance of a target, it is useful to be able to detect 
a plurality of targets and/or variants within a single sample. 

The ability distinguishably to detect a plurality of targets (or target variants) 
. within a single sample also finds use in generating comprehensive haplotypes or genotypes 
from individual samples for prognostic, diagnostic, or monitoring purposes; in screening 
large numbers of loci in a plurality of individuals to associate genotypes with phenotypes of 
interest; and as a way to increase genotyping throughput and lower costs per interrogated 
locus. 

Accordingly, in another aspect, the invention provides a method of 
distinguishably detecting the presence of a plurality of nonsupercoiled targets within a 
sample of nucleic acids with selectivity sufficient to distinguish each of the plurality of targets 
from variants that respectively differ by as few as one nucleotide therefrom at a query region 
that is common therebetween. 

The method comprises using a recombinase to mediate formation, 
separately for each of the plurality of targets desired to be detected, of at least one 
deproteinization-stable double D loop in the target's query region, under conditions that 
favor double D loop formation at the target query region over formation at variants that differ 
from the target by as few as one nucleotide therefrom, each target's double D-loop being 
distinguishably detectable from all others of the double D-loops formed in the sample; and 
then distinguishably detecting each of the stable double-D loops so formed. 

In this aspect of the invention, conditions typically comprise contacting the 
sample, for each of the plurality of targets desired to be detected, with a first oligonucleotide 
and a second oligonucleotide. The first oligonucleotide includes a complementarity region 
that is perfectly complementary in sequence to a first strand of its respective target across 
the entirety of the target query region, the second oligonucleotide includes a 
complementarity region that is perfectly complementary in sequence to a second strand of 
the same target across at least a portion of the target query region, and either or both of the 
oligonucleotide regions is imperfectly complementary in sequence to respective first and 
second strands of the query region of each of the other targets desired discriminably to be 
detected. 
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The first oligonucleotide is bound by a recombinase and the second 
oligonucleotide comprises base modifications and does not substantially bind the 
recombinase, and at least one of the first and second oligonucleotides is distinguishable 
from the first and second oligonucleotides used to detect each of the others of the plurality 
5 of targets desired to be detected. 

Analogously, in another aspect the invention provides methods of 
distinguishably detecting the presence of a plurality of supercoiled targets within a sample of 
nucleic acids with selectivity sufficient to distinguish each of the plurality of targets from 
variants that respectively differ by as few as one nucleotide therefrom at a query region that 

1 o is common therebetween. 

The method comprises using a recombinase to mediate formation, 
separately for each of the plurality of targets desired to be detected, of at least one 
deproteinization-stable single D-loop or double D-loop in the target's query region, under 
conditions that favor single D-loop or double D-loop formation at the target query region 
1 5 over formation at variants that differ from the target by as few as one nucleotide therefrom, 
each target's single or double D-loop being distinguishably detectable from all others of the 
D-loops formed in said sample; and then distinguishably detecting each of the stable 
double-D loops so formed. 

Typically, the formation conditions comprise contacting the sample, for 

2 o each of the plurality of targets desired to be detected, with a first oligonucleotide, wherein 

the first oligonucleotide includes a complementarity region that is perfectly complementary 
in sequence to a first strand of its respective target across the entirety of the target query 
region and is imperfectly complementary in sequence to a first strand of the query region of 
each of the other targets desired discriminably to be detected. 

2 5 The first oligonucleotide is bound by a recombinase and is distinguishable 

from the first oligonucleotide used to detect each of the others of the plurality of targets 
desired to be detected. 

The method may further comprise contacting the sample, for each of the 
plurality of targets desired to be detected, with a second oligonucleotide. 

3 o For each of the plurality of targets desired to be detected, the second 

oligonucleotide comprises base modifications and does not substantially bind said 
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recombinase. The second oligonucleotide includes a complementarity region that is 
perfectly complementary in sequence to a second strand of the target across at least a 
portion of the target query region, and at least one of the first and second oligonucleotide 
complementarity regions is imperfectly complementary to the respective strand of the query 
region of each of the target variants. The second oligonucleotide is distinguishably 
detectable. 

The plurality of single or double D-ioops in the multiplexed aspects of the 
present invention may be formed concurrently or seriatim, and the D loops so formed may 
be detected concurrently or seriatim. 

Methods well suited for concurrent detection include microarray detection, 
including detection of oligonucleotide-associated barcode tags using a microarray having 
tag complements, Shoemaker era/., Nature Genet. 14(4):450-6 (1996); EP 0799897; Fan et 
a/., Genome Res. 10:853-60 (2000); and U.S. Patent No. 6,150,516, the disclosures of 
which are incorporated herein by reference in their entireties. 

As used herein, the term "microarray" and the equivalent phrase "nucleic 
acid microarray" refer to a substrate-bound collection of plural nucleic acids, hybridization to 
each of the plurality of bound nucleic acids being separately detectable. The substrate can 
be solid or porous, planar or non-planar, unitary or distributed. As so defined, the term 
"microarray" and phrase "nucleic acid microarray" include all the devices so called in 
Schena (ed.), DNA Microarrays: A Practical Apnrnar. h (Practical Ap pmarh fi^), oxford 
University Press (1999) (ISBN: 0199637768); Nature Genet. 21(1)(suppl):1 - 60 (1999); and 
Schena (ed.), Microarray Bioc hip: Tools and Technology , Eaton Publishing 
Company/BioTechniques Books Division (2000) (ISBN: 1881299376), the disclosures of 
which are incorporated herein by reference in their entireties. 

The term "microarray" and phrase "nucleic acid microarray" also include 
substrate-bound collections of plural nucleic acids in which the plurality of nucleic acids are 
distributably disposed on a plurality of beads, rather than on a unitary planar substrate, as is 
described, inter alia, in Brenner etal., Proc. Natl. Acad. Sci. USA 97(4): 166501 670 (2000), 
the disclosure of which is incorporated herein by reference in its entirety; in such case, the 
term "microarray" and phrase "nucleic acid microarray" refer to the plurality of beads in 
aggregate. The term "microarray" and phrase "nucleic acid microarray" also include 
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substrate-bound collections of plural nucleic acids in which the plurality of nucleic acids are 
incorporated within a gel, as is described, inter alia, in U.S. Patent No. 6,174,683, the 
disclosure of which is incorporated herein by reference in its entirety. 

The plurality of single or double D-loops may also be detected concurrently 
5 using mass tags and mass spectrometry, typically time of flight mass spectrometry. 

The plurality of targets to be distinguishably detected may include a 
plurality of targets that differ from one another at a query region common thereamong by as 
few as one nucleotide. The targets may differ from one another by 2, 3, 4, 5, 6, 7, 8, 9, 
even 10 nucleotides in a common query region. Such targets desired concurrently to be 
10 detected may differ in a common query region by no more than 9, 8, 7, 6 and even by no 
more than 5 nucleotides. In some embodiments, the targets desired concurrently and 
distinguishably to be detected differ in a common query region by 1, 2, 3, 4, or 5 
nucleotides. 

Such targets may be naturally-occurring allelic variants, separate members 
15 of a gene family, or recombinantly engineered variants of a single progenitor nucleic acid. 

Additionally, or in the alternative, the plurality of targets to be 
distinguishably detected may include a plurality of unrelated targets. Such targets may, for 
example, be separate and discrete loci present in a common genome. 

The plurality of targets distinguishably to be detected may include as few 
20 as 2, 3, 4, or 5 targets, and may include 10, 20, 30, 40, or even 50 or more targets. The 
plurality of targets distinguishably to be detected may include as many as 100 targets, 250 
targets, 500 targets, even as many as 1 ,000, 2,000, 3,000, 4,000, or 5,000 or more targets. 
Even as many as 10,000 to 100,000 targets may be distinguishably detected. 

As noted above, the methods of this aspect of the invention may further 
2 5 include the step of quantifying the abundance of one or more of the targets within the 
sample. 

The relative abundance of allelic variants within a population - including 
single nucleotide polymorphisms and haplotypes comprising a plurality of variants at 
multiple loci - may be used, for example, to associate genotypes with phenotypes of 
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interest. Such phenotypes include monogenic and polygenic diseases, or predispositions 
thereto. 

In a single individual, the relative abundance of a particular variant at a 
particular locus, as compared to the abundance of a target at another locus, may be used 
5 e.g. to diagnose loss of heterozygosity, and thus the presence of a clonal subpopulation of 
transformed cells in a biological sample, which can be diagnostic of the presence of a 
cancerous or precancerous lesion. 

In one series of such embodiments, the invention permits detection of a 
trace amount of DNA derived from neoplastic or preneoplastic cells in a biological sample 
l o containing a majority of wild type DNA or whole cells. 

In these embodiments, the invention comprises the comparative 
measurement of two genomic sequences. One genomic sequence is stable through 
transformation— that is, it is identical in both malignant and wild type cells in the sample. A 
second genomic sequence typically undergoes change during the course of transformation 
is - that is, it is mutated or lost during the development of malignant precursor cells. The 
relative abundance of the sequences is determined: if a statistically significant difference in 
abundance is observed, diagnosis may be made. 

Statistical methods for determining significance of abundance differences 
are described in U.S. Patent No. 5,670,325, the disclosure of which is incorporated herein 
20 by reference in its entirety. 

For detection of colorectal carcinoma and precancerous colonic lesions, the 
sample is usefully a stool sample voided by a patient. Methods for physical preparation of 
stool samples for genetic detection are disclosed in U.S. Patent Nos. 5,670,325 and 
6,303,304, the disclosures of which are incorporated herein by reference in their entireties. 

25 In other embodiments, the first and second genomic samples may be 

maternal and paternal alleles of a single locus of a heterozygous individual. A statistically 
different abundance of one of the two alleles can indicate loss of heterozygosity, and the 
presence of a clonal subpopulation of transformed cells. 

Genes for which loss of heterozygosity has been implicated in the etiology 
30 of one or more human cancers, and which are thus usefully assayed using these 
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embodiments of the methods of the present invention, include p53, DCC, APC, CDK2AP1, 
DBCCR1, DDX26, DEC, DEC1, DLC1, DLEC1, DLEU1, DLEU2, DMBT1, DNB5, DOC-1R, 
DSS1, RERE (nomenclature pursuant to CancerGene database, Infobiogen, France: 
http://caroll.vjf.cnrs.fr/cancergene/HOME.html). 

5 In another aspect, the invention provides nucleic acid compositions within 

which nucleic acid targets that differ by as few as a single nucleotide may be distinguished. 

In one series of embodiments, the nucleic acid composition is 
characterized by the presence of at least one deproteinized double D loop at a query region 
within a nucleic acid target. The deproteinized double D-loop includes a first and a second 
10 oligonucleotide. 

The first oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a first strand of the target across the entirety of the target 
query region. The second oligonucleotide includes a complementarity region that is 
perfectly complementary in sequence to a second strand of the target across at least a 
1 5 portion of the target query region. The first oligonucleotide is bound by a recombinase, the 
second oligonucleotide comprises base modifications and does not substantially bind the 
recombinase, and at least one of said first and second oligonucleotides is distinguishably 
detectable. 

The first and second oligonucleotides have the features set forth above in 
20 the description of the methods of the present invention, which description is incorporated 
here by reference in its entirety. 

In certain embodiments, the composition further comprises at least one 
variant that differs from the target by as few as one nucleotide within a query region that is 
common therebetween; the query region of each of the target variants lacks a double D 
25 loop. 

In another aspect, the invention provides a nucleic acid composition 
characterized by the presence of a plurality of deproteinized double D-loops, each of the 
plurality formed at a query region within a respective nucleic acid target. 

Each double D-loop includes a first and a second oligonucleotide. Each 
3 o first oligonucleotide includes a complementarity region that is perfectly complementary in 
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sequence to a first strand of its respective target across the entirety of the target query 
region. Each second oligonucleotide includes a complementarity region that is perfectly 
complementary in sequence to a second strand of its respective target across at least a 
portion of the target query region. 

5 The first oligonucleotide is bound by a recombinase; the second 

oligonucleotide comprises base modifications and does not substantially bind said 
recombinase, 

In certain embodiments, at least two of the plurality of targets differ in 
sequence by at least one, and by fewer than 10 nucleotides as between their respective 

1 o • query regions, including 1 , 2, 3, 4, 5, 6, 7, 8, or even 9 nucleotides as between their 

respective query regions, with a sequence difference of 1 - 5 nucleotides being typical. In 
embodiments in which single nucleotide polymorphisms may be distinguished, at least two 
of the plurality of targets having double D-Ioops differ by exactly 1 nucleotide as between 
their respective query regions. 

1 5 The plurality of targets may include at least 1 0 targets, 1 00 targets, 1 000 

targets, even as many as 10,000 targets or more. The targets may be nonsupercoiled or 
supercoiled. 

In another aspect, the invention provides kits that are useful for performing 
the methods of the present invention. In some embodiments, the kits provide buffers; 

2 o and/or wash solutions; and/or detergents useful for removing recombinase, such as SDS; 

and/or a recombinase such as RecA, modified RecA or a yeast, human or mammalian 
analog of RecA; and/or annealing and/or incoming oligonucleotides; and/or positive and/or 
negative control samples; and/or instructions. 



25 
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The following examples are offered by way of illustration and not by 

limitation. 

EXAMPLE 1 

5 GENERAL PROTOCOLS FOR FORMATION AND 

DETECTION OF DOUBLE D-LOOPS AND Y-ARMS 

Formation of double D-loops. Typically, we form double D-loops or Y-arms 
(Y-arms are D-loops formed at the end of a duplex target; hereinafter, unless otherwise 

1 o dictated by context, the term D-loop includes Y-arms) by combining the following in a 7 pi 

reaction so that the final concentration in 10 pi is: 80 nM of the first or "incoming" 
oligonucleotide; 2.5 pM Escherichia coli RecA protein; 1 .0 mM ATP- y-S; 25 mM Tris- 
acetate, pH 6.8; 1 mM dithiothreitol; and 1 mM magnesium acetate. This reaction is 
incubated for 10 minutes at 37°C to allow for binding of RecA protein to the oligonucleotide 
1 5 ("presynapsis", see Figure 1 for an outline of the method). 

We then add double-stranded nucleic acid target, which is generally 32 P- 
end-labeled using T4 polynucleotide kinase to facilitate detection of the complex, at a 
concentration of approximately 20 nM and 10 mM magnesium acetate to a final volume of 
10 pi. We incubate this reaction for 10 minutes at 37°C to allow for synapsis between to the 

2 o incoming oligonucleotide and the target nucleic acid molecule. We then add the second or 

"annealing" oligonucleotide in 1 pi to a concentration of 640 nM (calculated for the original 
10 pi reaction volume) and incubate for 10 minutes at 37°C to allow the second 
oligonucleotide to anneal to the target nucleic acid. We then denature the RecA bound to 
the oligonucleotide:target complex by cooling the reaction to about 4°C in an ice bath and 

2 5 adding 1 pi of 10% SDS. The samples are then used immediately or stored at -20°C. 

Detection of double D-loops. We may analyze the samples prepared as 
described above by separating by polyacrylamide gel electrophoresis (PAGE). We dry the 
gels and detect the 32 P-labeled target duplex nucleic acid by either autoradiography or using 
a phosphorimager. We monitor the formation of the double D-loops under these assay 

3 o conditions by detecting the retarded migration of the labeled nucleic acid in the gel: the 



WO 03/027640 



71 



PCT/US02/31073 



labeled target nucleic acid in double D-loops migrates more slowly than duplex target 
nucleic acid. 



EXAMPLE 2 

DOUBLE D-LOOP FORMATION USING AN 
OLIGONUCLEOTIDE COMPRISING LNA 

Oligonucleotides used in this example. We employ the protocol described 
in Example 1 to form double D-loops between two DNA oligonucleotides and linear, duplex 
target DNA. The target duplex DNA is composed of two linear 70-mer oligonucleotides with 
sequence complementary to each other. The sequence of the first target strand, designated 
"OligoA", is: 5'-CTCCGGCCGCTTGGGTGGAG 

AGGCTATTCGGCTACGACTGGGCACAACAGACAATCGGCTGCTCTGATGC-3' (SEQ ID 
NO: 1) and the sequence of the second target strand, designated "OligoB", is: 3'- 
GAGGCCGGCGAACCCACC 

TCTCCGATAAGCCGATGCTGACCCGTGTTGTCTGTTAGCCGACGAGACTACG-5' (SEQ 
ID NO: 2). The nucleotide which is approximately at the center of the target sequence is 
indicated in underlined to allow for easy identification of the complementary sequence of 
subsequent DNA oligonucleotides. The first or incoming oligonucleotide, designated 
"OligoC", is a 30-mer with the following sequence: 5-AGGC 
TATTCGGCTACGACTGGGCACAACAG-3' (SEQ ID NO: 3) which is complementary to 
OligoB. The second or annealing oligonucleotide, designated "Oligol", is a 25-mer with the 
following sequence: 

5'-TTGTGCCCAGTCGTAGCCGAATAGC-3' (SEQ ID NO: 4) which is complementary to 
OligoA. In certain experiments, we use in place of Oligol as the annealing oligonucleotide 
the following 15-mer LNA oligonucleotide, designated "OligoN": 5-GCCCAGTCGTAGCCG- 
3' (SEQ ID NO: 5). 

To test the formation of Y-arms under our assay conditions, we use a 
different target duplex DNA composed of two linear 67-mer oligonucleotides with sequence 
complementary to each other. The sequence of the first target strand, designated "OligoT", 
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is: 5'-ACAACTGTGTTCACTAGCAACCTC 

AAACAGACACCATGGTGCACCTGACTCCTGAGGAGAAGTCTGC-3 , (SEQ ID NO: 6) and 
the sequence of the second target strand, designated "OligolP, is: 3'- 
TGTTGACACAAGTGATCGTTGGAG 
5 TTTGTCTGTGGTACCACGTGGACTGAGGACTCCTCTTCAGACG-5' (SEQ ID NO: 7). We 
use two oligonucleotides complementary to the end of the OligoT/OligoU linear target 
duplex in these Y-arm experiments. The first or incoming oligonucleotide, designated 
"OligoX", is a 30-mer with the following sequence: 5- 

GCAGACTTCTCCTCAGGAGTCAGGTGCACC-3' (SEQ ID NO: 8) which is complementary 
io to the end of the OligoT strand of the target duplex. The second or annealing 
oligonucleotide, designated tt Oligo5 D , is a 30-mer with the following sequence: 5- 
GTTGCACCTGACTCCTGAGGAGAAGTCTGC-3' (SEQ ID NO: 9) which is complementary 
to OligoU. 

Annealing oligonucleotide and RecA are required for formation of double D- 
15 loops. We demonstrate that both the annealing oligonucleotide and a recombination protein 
are required by following the protocol as described in Example 1 but omitting certain 
reagents or steps. The results of these experiments are shown in Figure 2 (using 
OligoA/OligoB target and OligoC/Oligol), Figure 3 (using OligoA/Oligo B target and 
OligoC/OligoN) and Figure 4 (using OligoT/OligoU target and OligoX/Oligo5). 

20 In a control experiment shown in lane 1 of these Figures, omitting both 

oligonucleotides leads to no complex formation. Similarly, when we deproteinize the 
complex after the addition of the incoming oligonucleotide and omit the annealing 
oligonucleotide, we do not observe any stable complexes (lane 2). This confirms that single 
D-loops containing DNA oligonucleotides are unstable after deproteinization. 

,2 5 Finally, as shown in lanes 3 and 4 of these figures, when we deproteinize 

the complex while adding the annealing oligonucleotide, we observe no double D-loop 
formation. This was observed regardless of whether the annealing step was carried out at 
37°C (lane 3) or 4°C (lane 4). These results indicate that the single D-loop must remain 
stabilized by the recombination protein for the annealing oligonucleotide to be incorporated 

30 into the structure. 
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To confirm this result, we perform a competition experiment where we add 
an excess of incoming oligonucleotide after the formation of the single D-loop and before 
the addition of the annealing oligonucleotide. As shown in lane 7 of Figures 2, 3 and 4, the 
addition of excess incoming oligonucleotide that has not been coated with RecA abolishes 
5 formation of double D-loops where the target sequence is in the middle of the linear duplex 
(Figures 2 and 3) and dramatically reduces the formation of Y-arms (Figure 4). By 
sequestering the annealing oligonucleotide in a hybrid with the complementary incoming 
oligonucleotide, formation of a double D-loop is inhibited. 

However, if the double D-loop or Y-arm structure is formed first by adding 
l o annealing oligonucleotide before the addition of excess free incoming oligonucleotide there 
is essentially no impairment of double D-loop formation. This result confirms that the 
annealing oligonucleotide is incorporated into the double D-loop prior to deproteinization of 
the complex. 

Oligonucleotides comprising LNA form double D-loops more efficiently. 
The results shown in Figure 2 (using OligoA/OligoB target and OligoC/Oligol), Figure 3 
(using OligoA/Oligo B target and OligoC/OligoN) and Figure 4 (using OligoT/OligoU target 
and OligoX/OligoS) lanes 5 and 6 demonstrate that using an oligonucleotide comprising 
LNA results in much greater double D-loop formation than when oligonucleotides containing 
only DNA residues are used. We observe this result both when we perform the 
deproteinization step, i.e. addition of SDS, at 4°C as in the standard protocol described in 
Example 1 (lane 6) or when we perform deproteinization at 37°C (lane 5). In addition, 
comparison of the results shown in Figure 3, lanes 5 and 6 to the results shown in Figure 2, 
lanes 5 and 6 also clearly shows that the use of an oligonucleotide comprising LNA results 
in much greater formation of double D-loops than when oligonucleotides containing only 
DNA residues are used. 

Oligonucleotides comprising LNA can form double D-loops in a single-step 
reaction. If incoming and annealing oligonucleotides are added simultaneously to the target 
duplex, double D-loop formation is severely impaired or absent. We demonstrate this using 
DNA oligonucleotides (OligoC/Oligol with OligoA/OligoB target) as shown in Figure 2, lane 9 
and lane 10. For these experiments we follow the protocol outlined in Experiment 1, except 
that we coat both oligonucleotides with RecA and then simultaneously add them in 
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equivocal amounts to the target nucleic acid. In Figure 2, lane 9 the oligonucleotides are 
combined and coated with RecA together before adding them to the target nucleic acid and 
in Figure 2, lane 10 the oligonucleotides are first separately coated with RecA and then 
added together to the target nucleic acid. As shown in Figure 4, lane 9 and lane 10, 
5 simultaneous addition of two DNA oligonucleotides also fails to support the formation of Y- 
arms. 

However, as shown in Figure 3, lane 9 and lane 10, simultaneous addition 
of two oligonucleotides where one of the oligonucleotides is composed of LNA residues 
results in surprisingly significant formation of double D-loops. The extent of double D-loop 
l o formation is greater when the oligonucleotides were separately coated with RecA before 
addition to the target nucleic acid (lane 10) than when they are mixed together before 
coating with RecA (lane 9). 



EXAMPLE 3 

15 DETERMINATION OF OPTIMAL TEMPERATURE AND ANNEALING 

TIMES FOR FORMATION OF DOUBLE D-LOOPS 



Determination of optimal annealing temperature. We test the formation of 
double D-loops using the OligoA/OligoB duplex as the target, OligoC as the incoming 

2 o oligonucleotide and Oligol as the annealing oligonucleotide (target sequence and 

oligonucleotides as described in Example 2). We follow the protocol described in Example 
1 , except we vary the temperature at which the reaction is incubated after the addition of the 
annealing oligonucleotide. As shown in Figure 5, we test incubation at 4°C, 15°C, 25°C, 
37°C and 45°C. We observe increased formation of the double D-loop as we increase the 

2 5 temperature up to approximately 37°C. In this experiment, the extent of double D-loop 
formation at 37°C is approximately equal to the extent of double D-loop formation at 45°C. 

We also determine the optimal temperature for the formation of Y-arms 
using the OligoT/OligoU duplex as the target, OligoX as the incoming oligonucleotide and 
Oligo5 as the annealing oligonucleotide. We vary the temperature as described above for 
30 the OligoA/OligoB experiment. As shown in Figure 6, we test incubation at 4°C, 15°C, 

25°C, 37°C, 45°C and 50°C. We observe almost quantitative conversion of the free duplex 
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target to oligonucleotide-containing Y-arms at all temperatures. However, as seen in the 
OligoA/OligoB experiment, increasing temperature clearly result in incremental increases in 
Y-aim formation, with optimal Y-arm formation at approximately 37°C. 

It is readily apparent to one of skill in the art that this procedure can be 
5 applied to any target nucleic acid or set of oligonucleotides to determine the annealing 
temperature that leads to optimal formation of the corresponding double D-loop or Y-arm 
structure. 

Determination of optimal annealing time. We test the formation of double 
D-loops using the OligoA/OligoB duplex as the target, OligoC as the incoming 

1 o oligonucleotide and Oligol as the annealing oligonucleotide. We follow the protocol 

described in Example 1 , except we vary the incubation time at 37°C after the addition of the 
annealing oligonucleotide. As shown in Figure 7, we test incubation for 1, 2, 3.5, 5, 7.5, 10, 
20, 30, 45 and 60 minutes. We observe that double D-loop formation occurs very rapidly 
and that the extent of double D-loop formation increases up to an incubation time of 
15 approximately 10 minutes. After 10 minutes, we observe that the proportion of target 

nucleic acid in double D-loops begins to decrease, probably due to instability of the double 
D-loop, which contained only DNA oligonucleotides. 

We determine the optimal annealing time for the formation of Y-arms using 
the OligoT/OligoU duplex as the target, OligoX as the incoming oligonucleotide and Oligo5 

2 o as the annealing oligonucleotide. We vary the annealing time as described above for the 

OligoA/OligoB experiment. As shown in Figure 8, we test incubation at 37°C for 1, 2, 3, 4, 
5, 6, 8, 10, 15 and 20 minutes. We observe almost quantitative conversion of the free 
duplex target to oligonucleotide-containing Y-arms at the one-minute time point and we do 
not observe a substantial increase in Y-arm formation with longer incubation times. In 
2 5 contrast to the OligoA/OligoB experiment, however, we do not see a reduction in the ratio of 
free duplex target to target complexed with oligonucleotides in the Y-arms, which is 
probably due to the fact that double D-loops formed at the end of a linear duplex target (i.e. 
Y-arms) are generally more stable that double D-loops formed in the middle of a linear 
duplex target. 
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It is readily apparent to one of skill in the art that this procedure can be 
applied to any target nucleic acid or set of oligonucleotides to determine the annealing time 
that leads to optimal formation of the corresponding double D-loop. 



EXAMPLE 4 

DETERMINATION OF OPTIMAL OLIGONUCLEOTIDE LENGTHS FOR 
FORMATION OF DOUBLE D-LOOPS OR Y-ARMS 



, l o Oligonucleotides used in this example. We use the OligoT/OligoU duplex 

as the target nucleic acid for these experiments. We use incoming DNA oligonucleotides 
complementary over a range of lengths to the end of the duplex target as follows: OligoV is 
a 20-mer with the sequence 5'-GCAGACTTCTCCTCAGGAGT-3' (SEQ ID NO: 10); OligoW 
is a 25-mer with the sequence 5'-GCAGACTTCTCCTCAGGAGTCAGGT-3' (SEQ ID NO: 

15 11); OligoX is a 30-mer (SEQ ID NO: 8); OligoY is a 35-mer with the sequence 5'- 

GCAGACTTCTCCTCAGGAGTCAGGTGCACCATGGT-3' (SEQ ID NO: 12); OligoZ is a 40- 
mer with the sequence 5'-GCAGACTTCTCCTCAGGAGTCAGGTGCACCATGGTGT 
CTG-3' (SEQ ID NO: 13); and Oligol is a 46-mer with the sequence 5'- 
GCAGACTTCTCCTCAGGAGTC 

2 o AGGTGCACCATGGTGTCTGTTTGAG-3' (SEQ ID NO: 14). We use annealing DNA 
oligonucleotides complementary to the end of the duplex target (and to the incoming 
oligonucleotides) over a range of lengths as follows: Oligo2 is a 20-mer with the sequence 
5'-ACTCCTGAGGAGAAGTCTGC-3' (SEQ ID NO: 15); Oligol is a 25-mer with the 
sequence 5'-ACCTGACTCCTGAGGAGAAGTCTGC-3' (SEQ ID NO: 16); Oligo5 is a 30- 

2 5 mer with the sequence 5'-GTTGCACCTGACTCCTGAGGAGAAGTCTGC-3' (SEQ ID NO: 
9); Oligo6 is a 35-mer with the sequence 5'-ACCATGGTGCACCTGAOTCCTGAGGAG 
AAGTCTGC-3' (SEQ ID NO: 17); Oligo7 is a 40-mer with the sequence 5'- 
CAGACACCATGGTGCACCT 

GACTCCTGAGGAGAAGTCTGC-3' (SEQ ID NO: 18); and Oligo8 is a 46-mer with the 
30 sequence 

5'-ACCTGACTCCTGAGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAA-3' (SEQ ID NO: 
19). 
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Oligonucleotides of different lengths form double D-loops and Y-arms. We 
use the oligonucleotides described above to form Y-arms according to the protocol detailed 
in Example 1. As shown in Figure 9, lanes 1-6, all of these oligonucleotides, ranging in size 
from 20 nucleotides to 46 nucleotides, efficiently form a Y-arm when the oligonucleotides 
5 are of equal length. This experiment does show, however, that longer oligonucleotides 
appear to form a Y-arm more efficiently. We show in Figure 10, lanes 1-5, that the 
oligonucleotides used for the formation of a Y-arm do not need to be of the same length and 
that the annealing oligonucleotide can be either longer (lane 1) or shorter (lanes 2-5) than 
the incoming oligonucleotide. 

io It is readily apparent to one of skill in the art that this procedure can be 

applied to any target nucleic acid or set of oligonucleotides to determine the length of the 
oligonucleotides that lead to optimal formation of the corresponding double D-loop. 



EXAMPLES 

15 

DETERMINATION OF OPTIMAL OLIGONUCLEOTIDE COMPOSITION 
FOR FORMATION OF DOUBLE D-LOOPS 



Oligonucleotides used in this example. We use the OligoA/OligoB duplex 
20 as the target nucleic acid for these experiments. As the incoming oligonucleotide, we use 
either OligoC (SEQ ID NO: 3) or the complementary oligonucleotide which targets the 
opposite strand of the duplex target, designated "OligoD", which has the sequence 5- 
CTGTTGTGCCCAGTCCTAGCCGAATAGCCT-3' (SEQ ID NO: 20). We then use the 
following annealing oligonucleotides: OligoE is a DNA 30-mer with the sequence 
2 5 5-AGGCTATTCGGCTACGACTGGGCACAACAG-3' (SEQ ID NO: 21); OligoF is a DNA 25- 
mer with the sequence 5'-GCTATTCGGCTACGACTGGGCACAA-3' (SEQ ID NO: 22); 
OligoG is a DNA 20-mer with the sequence S'-AnCGGCTACGACTGGGCAC-S' (SEQ ID 
NO: 23); OligoH is a DNA 30-mer with the sequence 5- 

CTGnGTGCCCAGTCCTAGCCGAATAGCCT-3' (SEQ ID NO: 24); Oligol is a DNA 25-mer 
3 0 with the sequence S'-TTGTGCCCAGTCGTAGCCGAATAGC-^SEQ ID NO: 4); OligoJ is a 
Z-O-methyl-RNA (2-OMe-RNA) 25-mer with the sequence 5- 
GCUAUUCGGCUACGACUGGGCACAA-3' (SEQ ID NO: 25); OligoK is a 2'-OMe-RNA 30- 
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mer with the sequence S'-CUGUUGUGCCCAGUCCUAGCCGAAU 

AGCCU-3* (SEQ ID NO: 26); OligoL is a 2'- O-methyl-RNA 25-mer with the sequence 5'- 

UUGUGCCCA 

GUCGUAGCCGAAUAGC-3' (SEQ ID NO: 27); OligoM is a DNA 25-mer with 
5 phosphorothioate backbone linkages with the sequence 5'- 

TTGTGCCCAGTCGTAGCCGAATAGC-3' (SEQ ID NO: 28); OligoN is an LNA 15-mer 
(SEQ ID NO: 5); OligoO is a LNA-DNA-LNA 15-mer with the sequence 5-GCCCagtcgtaG 
CCG-3', where the LNA residues are indicated with capital letters and the DNA residues are 
in lowercase (SEQ ID NO: 29); OligoP is an LNA-DNA-LNA 25-mer with the sequence 5- 

l o TTGtgcccagtcgtagccgaatA 

GC-3',where the LNA residues are indicated with capital letters and the DNA residues are in 
lowercase (SEQ ID NO: 30); OligoQ is a PNA 18-mer with the sequence lys- 
ACGGGTCAGGATCGGCTT-gly (SEQ ID NO: 31); OligoR is a PNA 18-mer with the 
sequence lys-ACGGGTCAGCATCGGCTT-gly (SEQ ID NO: 32); OligoS is a PNA 20-mer 

1 5 with the sequence Ac-E-GTGCCCAGTCCTAGCCGAAT-E-NH 2 (SEQ ID NO: 33). All of the 
oligonucleotides (or PNAs) are completely complementary to the target sequence except 
OligoH, OligoK, OligoQ and OligoS which each have a single basepair mismatch. 

We show in Figure 11 the formation of double D-loops using OligoA/OligoB 
as the target nucleic acid, OligoC as the incoming oligonucleotide and the oligonucleotides 

20 as the annealing oligonucleotide as indicated in the Figure. From these data it is apparent 
that oligonucleotides with a mismatched base can form a double D-loop (lanes 2 and 3) and 
that when we use 2'-OMe-RNA oligonucleotides (lanes 3 and 5), PNA (lane 7) and LNA 
oligonucleotides (lanes 8-10) as annealing oligonucleotides the formation of double D-loops 
is more robust that with DNA oligonucleotides. We also find that annealing oligo- 

2 5 nucleotides containing phosphorothioate modifications do not function as well as DNA for 
the formation of double D-loops (lane 6). Finally, this experiment confirms that 
oligonucleotides that are partially modified still enhance double D-loop formation relative to 
a DNA oligonucleotide (lanes 8 and 10). 

We confirm these results using the OligoA/OligoB target nucleic acid, 
3 o OligoD as the incoming oligonucleotide, which recognizes the opposite strand of the 
OligoA/OligoB target nucleic acid relative to the previous experiment, and annealing 
oligonucleotides as indicated in Figure 12. These data confirm that an oligonucleotide with 
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a mismatched base can form a double D-loop (lane 6) and that when we use 2'-OMe-RNA 
oligonucleotides (lane 5) and PNA (lanes 6 and 7) as annealing oligonucleotides the 
formation of double D-loops is more robust that with DNA oligonucleotides. Combined with 
the previous experiment, these data also show that the double D-loop can be formed when 
5 the incoming oligonucleotide recognizes either strand of the target duplex. 

It is readily apparent to one of skill in the art that this procedure can be 
applied to any target nucleic acid or set of oligonucleotides to determine the composition of 
the oligonucleotides that lead to optimal formation of the corresponding double D-loop and 
that a wide range of oligonucleotides functions in the methods of the invention. 

.0 

EXAMPLE 6 

OLIGONUCLEOTIDETARGET CAPTURE AND DNA DETECTION 

5 Oligonucleotides used in this example. We use the OligoA/OligoB duplex 

as the target nucleic acid for these experiments. We use a 32 P-Iabeled DNA oligonucleotide 
as the incoming oligonucleotide. We then use a biotin-Iabeled annealing oligonucleotide 
comprising at least one modified backbone that enhances hybrid stability or a modified base 
that enhances hybrid stability. 

o The capture/detection assay. We assay the presence of two 

oligonucleotides (one biotin-Iabeled and the other 32 P-labeled) on the OligoA/OligoB duplex 
target molecule by capturing biotin-containing-o!igonucleotide:target double D-loops on 
streptavidin-coated paramagnetic beads. The beads are washed in IxRecA reaction buffer 
(1.0 mM ATP-y-S; 25 mM Tris-acetate, pH 6.8; 1 mM dithiothreitol; and 1 mM magnesium 

5 acetate), 10xRecA reaction buffer, and finally in IxRecA reaction buffer. Before DNA 
capture, equal aliquots of washed beads are added to individual 1.5 ml microcentrifuge 
tubes and the final wash buffer is removed. Liquid is removed from all bead suspensions by 
placing microcentrifuge tubes containing the bead mixtures in a magnetic separating rack. 

The double D-loop containing samples from above are each added to a 
o microcentrifuge tube containing an aliquot of the washed paramagnetic beads. The samples 
are mixed, and incubated at room temperature for 15 min. The mixtures are shaken several 
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times during incubation to ensure efficient biotinrstreptavidin interaction. After the capture 
reaction, i.e., the binding of streptavidin to biotin, the paramagnetic beads in each reaction 
are amassed with a magnet and the reaction buffer removed. 

Each sample of beads is washed three times with IxRecA reaction buffer. 
5 The presence of 32 P-labeled probe strand is assessed by scintillation counting of the DNA 
captured by each bead reaction. 

The results indicate that the hybridization product, containing two 
complementary but differentially labeled oligonucleotides, can be captured using the 
streptavidin interaction with the biotin labeled probe strand and subsequently detected by a 
l o label in the complementary probe strand. 



EXAMPLE 7 

RECA+ FACILITATED DNA AMPLIFICATION 
15 WITHOUT TARGET DNA DENATURATION 



Reaction conditions for RecA protein facilitated DNA amplification have 
been described in United States Patent No. 5,223,414, incorporated herein by reference in 
its entirety. 

20 We use a double-stranded duplex DNA target derived from plasmid DNA 

and two sets of oligonucleotides that form double D-loops at discrete sites separated by at 
least 200 nucleotides for ease of detection. We ensure that elongation of DNA primers 
occurs in only the desired direction, by terminating the 3-ends of the appropriate primers 
with 2',3'-dideoxynucleotide, which lacks the 3-hydroxyl group present in the conventional 

2 5 dNTPs and essential for elongation therefrom. We add the dideoxynucleotide to the primer 

using the enzyme terminal deoxynucleotide transferase. 

We form double D-loops in the target nucleic acid using the two sets of 
oligonucleotides described above and the method described in Example 1. We then use 
the resulting two sets of double D-loops as the substrate in a typical DNA amplification 

3 o reaction. The DNA reaction can be carried out in buffer containing 1 0 mM Tris-HCI (pH 7.5), 

8-12 mM MgCI 2) and 50 mM NaCI supplemented with 200-750 \iM dNTPs and DNA 
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polymerase (e.g., exonuclease-free, DNA polymerase I, Klenow, or 17 DNA polymerase). 
The reaction may additionally be supplemented with other enzymes or proteins (e.g. DNA 
heiicase, DNA ligase and SSB protein) which may facilitate the formation of the specific 
amplification product. The reaction is allowed to proceed for as long as necessary at 37°C. 
5 Upon termination, samples are optionally deproteinized and analyzed by gel 

electrophoresis. After electrophoretic separation, the resulting amplified DNA can be 
visualized by either ethidium bromide staining of the DNA in the gel or by DNA hybridization 
with a target specific DNA probe. Alternatively, one of the DNA oligonucleotides can be 
biotinylated and the newly synthesized DNA captured by appropriate means and then 

1 o detected as previously described. 

DNA synthesis reactions are initiated by the addition of 1-2 unit(s) of 
exonuclease-free E. coli DNA polymerase I (U.S. Biochemicals) and 750 pM of each dNTP. 
The reactions are incubated at 37°C. 

Following the initial addition of polymerase, the reactions can be 
15 supplemented with 1 unit of e.g., Klenow and/or additional dNTPs, at specific intervals 
spaced over the time course of the reaction. 

Samples are treated with proteinase K, before being loaded for 
electrophoretic separation. After electrophoretic separation the resulting amplified DNA 
fragments can be visualized by either ethidium bromide staining of the gel or by 

2 o • hybridization with a target specific probe. 

For hybridization analysis the gel can be transferred by standard protocols 
onto hybridization transfer membrane. We then detect the DNA using end-labeled probe 
corresponding to the DNA sequence of the target nucleic acid internal to the two double D- 
loops. We then detect hybridization signal by autoradiography or using a phosphorimager. 
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EXAMPLE 8 

IN SITU DNA DETECTION UTILIZING THE DOUBLE D-LOOP REACTIONS 

5 Preparation of oligonucleotide complex. We design oligonucleotides to 

form a double D-loop in a target nucleic acid. One of these oligonucleotides comprises LNA 
and one of these oligonucleotides comprises a detectable fluorophore. 

Preparation and transformation ofHeLa cells. We grow HeLa cells at 37°C 
and 5% C0 2 in a humidified incubator to a density of 2x1 0 5 cells/ml in an 8 chamber slide 

10 (Lab-Tek). We replace the DMEM with Optimem and transfect the cells with 5 pg of RecA- 
coated oligonucleotides that are previously complexed to 10 pg lipofectamine according to 
manufacturer's directions (Life Technologies). We treat the cells with the liposome, 
oligonucleotide mix for 6 hours at 37°C. We wash the treated cells with PBS and add fresh 
DMEM. After a 16-18 hour recovery period we assay the cells for fluorescence indicative of 

15 formation of the double D-loop. Specific signals are detected using standard fluorescence 
microscopy observation techniques. 

EXAMPLE 9 

2 0 RECA MEDIATED DOUBLE D-LOOP HYBRIDIZATION REACTIONS 

USING A VARIETY OF COFACTORS 



Oligonucleotides used in this example. We use the 32 P-labeled 
OligoA/OligoB duplex as the target nucleic acid for these experiments. We use the DNA 

2 5 oligonucleotide OligoC as the incoming oligonucleotide and OligoN as the annealing 

oligonucleotide. 

Double-D-loops can be formed using different cofactors for the RecA 
protein. We use the above-mentioned oligonucleotides to form double D-loops according to 
Example 1 except we substitute rATP, dATP or GTP- y-S for ATP- y-S in the RecA coating 

3 o reaction. These reactions are performed with or without a regenerating system. The double 

D-loops are then deproteinized and detected as described previously. 
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EXAMPLE 10 

DOUBLE D-LO0P FORMATION OCCURS UNDER A RANGE OF CONDITIONS 

5 

We test the ability of the double D-loop formation reaction to tolerate 
variations in reagent concentrations. We form double D-loops by combining 1 .1 p| of 
fluorescently labeled incoming oligonucleotide LDF/45G (5'-Cy™5- 

GGTGGAGAGGCTATTCGGCTAGGACTGGGCACAACAGA 

10 CAATCGG-3'; SEQ ID NO: 34), 3 pi of 5x Synaptic Buffer (125 mM Tris-acetate, 5 mM 
Mg(acetate) 2 and 5 mM DTT), 1.5 ul 10 mM ATP-y-S, water and 73.5 pM RecA. We vary 
the concentration of incoming oligonucleotide in the 1 .1 ul sample using 2.25 pM, 4.5 pM, 9 
uM or 18 pM. We vary the concentration of RecA relative to the concentration of the 
incoming oligonucleotide in the mixture, e.g. we add 0.5 pi RecA (73.5 pM) to the mixture 

15 when the concentration of the incoming oligonucleotide is 2.25 pM, 1.0 pi RecA when the 
concentration is 4.5 pM, and so on. Prior to the addition of RecA, water is added to the 
reaction mixture so that the final volume after the addition of RecA is 14 pi. We incubate 
this reaction for 10 minutes at 37°C to allow for binding of RecA protein to the 
oligonucleotide ("presynapsis", see Figure 1 for an outline of the method). 

We prepare a double-stranded target by PCR from a neomycin 
phosphotransferase gene with a point mutation (Kan-) using two oligonucleotide primers: 
3910U (5'-CAGGGGATCAAGATCTGAT-3'; SEQ ID NO: 35) and 3CGT«" (5'- 
GCTTCAGTGACAACGTCGAG-3'; SEQ ID NO: 36). The sequence of the resulting PCR 
product is shown in Figure 13 (SEQ ID NO: 37). We add 3.5 pi of the PCR product at a 
25 concentration of 0.7 pM and 2.5 pi 74 mM Mg(acetate) 2 . We incubate this reaction for 10 
minutes at 37°C to allow synapsis between the incoming oligonucleotide and the target 
nucleic acid molecule. We then add 0.74 pi of 27 pM annealing oligonucleotide KL02 which 
comprises LNA modified residues 

(5'-GCCCAGTCGTAGCCG-3'; SEQ ID NO: 38). We incubate this reaction for 5 minutes at 
37°C to allow the annealing oligonucleotide to anneal to the target nucleic acid. We then 
stop the reaction by placing briefly on dry ice. We denature the RecA bound to the 
oligonucleotide:target complex by placing the reaction at about 4°C in an ice bath and 
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adding 2 pi of 10% SDS and 2 pi of 10x loading dye (15-25% Ficoll, optionally 
supplemented with 0.05% bromophenol blue). 

We analyze the samples prepared as described above by separating on a 
2.5% agarose gel at 4°C. The gel does not contain ethidium bromide. We strain the gels 
5 after running then with 1x SYBR® green, a dye which binds double-stranded DNA, and scan 
on a Typhoon™ imager, We monitor the gel positions of the double D-loop and the double- 
stranded target DNA by detecting the SYBR® green dye. We monitor the formation of 
double D-loops under these assay conditions by detecting the retarded migration of the 
fluorescently labeled incoming oligonucleotide. 

10 We observe that double D-loops are formed at all incoming oligonucleotide 

concentrations. The fraction of target molecules that are in double D-loops at the different 
incoming oligonucleotide concentrations is 33% with 2.25 pM oligonucleotide; 37% with 4.5 
pM oligonucleotide; 39% with 9 pM oligonucleotide; and 42% with 18 pM oligonucleotide. 
This indicates that the efficiency of double D-loop formation varies depending on the 

15 incoming oligonucleotide concentration. These results also indicate that double D-loop 
formation occurs over a wide range of oligonucleotide concentrations. Based on these 
results, unless indicated otherwise, the reaction in the following examples uses 1.11 pi of 18 
pM incoming oligonucleotide and 2.0 pi of RecA. 



20 EXAMPLE 11 

EFFECT OF VARYING ANNEALING OLIGONUCLEOTIDE COMPOSITION AND TARGET 
SEQUENCE ON DOUBLE D-LOOP FORMATION 



25 We test the effect of varying the length, composition and sequence of the 

annealing oligonucleotide and the sequence of the target nucleic acid molecule on double 
D-loop formation. 

We form double D-loops using the Kan- double-stranded PCR product as 
the target nucleic acid molecule following the protocol described in Example 10 except that 
3 0 we incubate the reaction after adding the annealing oligonucleotide for 1 0 minutes at 37°C. 
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10 



We analyze the samples prepared by separating on a 2.5% agarose gel at 4°C as 
described above. 

We test annealing oligonucleotides comprising PNA; DNA with a 
phosphorothioate backbone; Z-O-methyl RNA; and LNA as indicated in Table 1. As 
indicated in Table 1, we test annealing oligonucleotides of various lengths. We normalize 
the percentage of double D-loop formation relative to the percentage of double D-loop 
formation using KL02 as the annealing oligonucleotide to calculate relative efficiency. As 
indicated in Figure 14, we observe that stable double D-loop formation occurs using any of 
the test annealing oligonucleotides. In this reaction, we observe that double D-loop 
formation is most efficient when oligonucleotides comprising PNA are used in the reaction. 
As indicated by the error bars in Figure 14, the variation in the efficiency of double D-loop 
formation is very low, indicating that double D-loop formation can be used in quantitative as 
well as qualitative applications. 





Table 1 

Annealing Oligonucleotides for Kan- Target 


Name 


composition* 


Sequence ~ 


SEQ 
ID 

NO: 


KM2 


PNA 


Ac-E-GTGCCCAG I CCTAGCCGAAT-E-NH 2 ' 


39 


UDS15G 


DNA with 

phosphorothioate 

backbone 


5'-GCCCAG'rCGTAGCCG-3' 


40 


UR15G 


2"-0-MeRNA 


5'-GCCCAGUCGUAGCCG-3' 


41 


KL02 


LNA 


5-GCCCAGTCGTAGCCG-3' 


42 


KL06 


LNA 


5'-CCCAGTCGTAGCC-3' 


43 


KL015 

*Th 


LNA 

PS linHprlinoH raciHnac in 


5'-GTGCCCAGTCGTAGCCGAAT-3' 

t/l rnc rvKi a ,i" : — : ^— 


44 



"The underlined residues in KL015 are UNA, the remainder are LNA 



We also form double D-loops using a PCR product produced from a 
plasmid containing the a functional hygromycin resistance gene (Hyg+) or a hygromycin 
resistance gene containing a point mutation (Hyg-; Figure 15; SEQ ID N0:_J using 
primers AUR123f (5'-TCTGCACAATATTTCAAGC-3'; SEQ ID NO: 45) and Hyg1560r (5'- 
AAATCAGCCATGTAGTG-3'; SEQ ID NO: 46). We follow the protocol described in this 
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Example for the formation of double D-loops in the Kan- PGR product using HygUDF/45G 
as the incoming oligonucleotide (5'-Cy™5-CGCAGCTATTTACCCGCAGGACCTATCCA 
CGCCCTCCTACATCGA-3'; SEQ ID NO: 47) and various annealing oligonucleotides as 
indicated in Table 2. We analyze the samples prepared by separating on a 2.5% agarose 
5 gel at 4°C as described above. 

We test annealing oligonucleotides comprising LNA as indicated in Table 2. 
As indicated in Table 2, we also test annealing oligonucleotides of various lengths! We 
normalize the percentage of double D-loop formation relative to the percentage of double ID- 
loop formation using Hyg15T as the annealing oligonucleotide to calculate relative 
l o efficiency. Asjndicated in Figure 16, we observe that stable double D-loop formation occurs 
using any of the test annealing oligonucleotides. In this reaction, we observe that double D- 
loop formation is most efficient when we use the longest of the test oligonucleotides. As 
indicated by the error bars in Figure 16, the variation in the efficiency of double D-loop 
formation in this reaction is also very low. 

15 



Table 2 

Annealing Oligonucleotides for Hyg- Target 


Name 


Composition* 


Sequence 


SEQ ID 
NO: 


Hyg4 


LNA 


5'-GGATAGGJCC-3' 


48 


Hyg6 


LNA 


5'-TGGATAGGTCCT-3' 


49 


Hyg7 


LNA 


5'-GTGGATAGGTCCTGC-3' 


50 


Hyg15 


LNA 


5'-GTGGATAGGTCCTGC-3' 


51 


*T 


tie underlined residues in these oligonucleotides are DNA, the remainder are 



LNA. 



EXAMPLE 12 

20 

DOUBLE D-LOOP FORMATION IS SEQUENCE-SPECIFIC 



We test the effect of heterologous nucleic acid molecules on the efficiency 
of double D-loop formation. We form double D-loops using the Kan- double-stranded DNA 
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tanget as described in Example 10 except that we add heterologous competitor nucleic acid 
molecules along with the Kan- target and Mg(acetate) 2 . We perform the competition 
experiments using two competing nucleic acid molecules which do not have significant 
sequence homology with the Kan- target or either the incoming or annealing 
oligonucleotides: the Hyg- PCR product described above (Figure 15; SEQ ID NO:_) and 
poly dl-dC (Sigma). 

We first add the Hyg- PCR fragment to the mixture in a 1:1, 5:1 and 10:1 
molar ratio relative to the amount of the Kan- target nucleic acid molecule. We observe that 
the addition of the non-specific Hyg- PCR fragment has no noticeable effect on the 
efficiency of double D-loop formation. We then add the poly dl-dC non-specific competitor 
nucleic acid molecule in vast excess (10S 10*-, 10*-, 10*- and 105-fold excess over the 
amount of the Kan- target nucleic acid molecule). Even with such a vast excess of 
competitor, we observe, no noticeable effect on the efficiency of double D-loop formation. In 
the presence of 105-fold excess of the poly dl-dC non-specific competitor, we observed 
approximately 50% efficiency of double D-loop formation and the concentration of the Kan- 
target nucleic acid molecule was limiting for double D-loop formation. These results indicate 
that double D-loop formation is sequence specific and that a nucleic acid molecule that 
represents a very small fraction of the nucleic acid molecules in a reaction serves as the 
target for double D-loop formation. 



EXAMPLE 13 

DOUBLE D-LOOP FORMATION IN A LINEARIZED PLASM ID TARGET 

We test the efficiency of formation of double D-loops in a large plasmid 
target. We form double D-loops as described in Example 10 except that we add 2.5 pi (0.5 
pg) of a linearized 8.2 kb plasmid comprising the Kan- target gene and 7.35 pi of water. We 
add additional water to the reaction because of the higher target concentration. We then 
monitor the formation of double D-loops in the target plasmid using different annealing 
oligonucleotides (KM2, KL02, KL06, and KL015; Table 1). We analyze the samples 
prepared by separating on a 1 % agarose gel at 4°C as described above and detect 
formation of the double D-loop by monitoring colocalization of the SYBR® green and the 
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Cy™5 marker on the incoming oligonucleotide. We can not accurately assess the efficiency 
of double D-loop formation in these reactions because the formation of the double D-loop 
does not produce a large enough mobility shift in the target. 

We observe efficient double D-loop formation in the plasmid target with any 
5 of the four annealing oligonucleotides. In contrast to our observations with a smaller Kan- 
nucleic acid target molecule (Example 11; Figure 14), we observe that the a shorter, 15mer 
oligonucleotide (KL02; SEQ ID NO: 38) forms double D-loops at a slightly greater efficiency 
than a 20mer oligonucleotide (KL01 5; SEQ ID NO: 43). In addition, we observe little or no 
difference in the apparent efficiency of double D-loop formation when we use an annealing 
l o oligonucleotide comprising PNA or LNA. These results indicate that the size of the target 
does not significantly affect the efficiency of double D-loop formation. 



EXAMPLE 14 

15 TOPOISOMERASE I ENHANCES DOUBLE D-LOOP FORMATION IN A SUPERCOILED 

PLASMID TARGET 



We test the effect of adding topoisomerase I on the formation of double D- 
loops in a large, supercoiled plasmid target. We form double D-loops as described in 
2 o Example 1 3 except that the plasmid comprising the Kan- target gene is supercoiled and we 
add various amounts of topoisomerase I (0.5, 1 .0 and 1 .5 units. (One unit of topoisomerase 
I relaxes completely 0.5 jig of plasmid in 30 minutes at 37°C.) to the reaction along with the 
target and we increase the incubation during that step to 30 minutes at 37°C. We then 
monitor the formation of double D-loops in the target plasmid using different annealing 

2 5 oligonucleotides (KM2 and KL02; Table 1). We analyze the samples by separating on a 

1% agarose gel at 4°C as described above and detect formation of the double D-loop by 
monitoring colocalization of the bound SYBR® green marker and the Cy™5 marker on the 
incoming oligonucleotide. 

We observe efficient double D-ioop formation in the supercoiled plasmid 

3 o target with either annealing oligonucleotide in both the presence and absence of 

topoisomerase I. We observe that increasing the amount of topoisomerase I increases the 
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amount of double D-loop formation. Both LNA and PNA support comparable levels of 
double D-loop formation in supercoiled target nucleic acid. The reactions in which the 
double D-loops are formed with PNA have less background than reactions using LNA. 
These results indicate that double D-loops can be formed in a supercoiled target following 
the teachings of the instant invention. Further, topoisomerase I enhances, but is not 
essential for, double D-loop formation in a supercoiled target. 



EXAMPLE 15 

PURIFICATION OF NUCLEIC ACID MOLECULES USING 
DOUBLE D-LOOPS 



We test whether the sequence specificity of double D-loop formation can 
be used for purification of a nucleic acid molecule of defined sequence from a complex 
mixture. We form double D-loops as described in Example 14 with the following exceptions: 
we use LDB/45G at a concentration of 18 uM and 5 units of topoisomerase I; the target 
nucleic acid is a 1:1 mixture of supercoiled pBR322 (Ap R , Tet R ) and the supercoiled Kan* 
plasmid used in Example 14; and the reaction is incubated for 1 hour at 37°C after addition 
of the target and topoisomerase I. LDB/45G has the same sequence as LDF/45G (used in 
Example 10), except that LDB/45G is not labeled with Cy™5 and has a biotin molecule 
attached at the 3' end by a TEG linker (5'- 

GGTGGAGAGGCTATTCGGCTAGGACTGGGCACAACAGACAA 
TCGG-3'bioTEG; SEQ ID NO: 52). Neither the incoming oligonucleotide (LDF/45G) or the 
annealing oligonucleotide (KL02) has significant sequence complementarity to any 
sequence in pBR322. 

We denature the RecA by cooling the reaction to about 4°C by placing it in 
an ice bath and adding 2 pi 10% SDS as above. We then add KCI to a final concentration 
of 100 mM to precipitate the SDS, spin at 5000 rpm for 5 minutes in a microcentrifuge to 
pellet the precipitated SDS and transfer the supernatant to another tube. We add 2 pi of 
Dynabeads™ diluted in 1x Synaptic Buffer and incubate the reaction for 2 hours at 4°C with 
vertical rotation to mix the solution. We then separate the magnetic Dynabeads™ by placing 
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in a magnetic tube holder for 5 minutes and remove the supernatant. We wash twice in 1x 
TE buffer (GibcoBRL) by suspending the pellets in 50 pi ice-cold TE and agitating for 5 
minutes at 4°C, separating the magnetic Dynabeads™ by placing in a magnetic tube holder 
for 5 minutes at 4°C and removing the supernatant. We elute the plasmid DNA from the 
5 Dynabeads™ by adding 10 pi TE and heating the solution at 65°C for 15 minutes. We 
collect the eluant and mix 5 pi with 20 pi electrocompetent DH10B cells. We electroporate 
the sample in a Cell-Porator® set at 330 pF, 4 Q and 400 V. We remove the cells and place 
them in 1 ml of SOC medium and allow them to recover by incubating at 37°C for 1 hour. 
We then dilute the cells into 4 ml of SOC medium and incubate on a shaker at 37°C for 2 
l o hours. We spin the cells down in a table top centrifuge for 5 minutes at 3750 rpm and 
resuspend the cells in 750 pi LB. We dilute these cells 1:10 into fresh LB and plate 1 00 pi 
of cells on LB plates supplemented with either 10 pg/ml tetracycline or 20 pg/ml kanamycin. 
We incubate the plates at 37°C overnight and count the colonies after approximately 16 
hours. 

15 We observe extremely clean purification of the target plasmid. See Table 

3. We are unable to determine the efficiency of purification because we observe no 
background colonies. These results indicate that the double D-loops formed according to 
the methods of the invention can be used to purify a nucleic acid molecule of known 
sequence away from other nucleic acid molecules. 

20 



Table 3 

Separation of Kan R plasmid from pBR322 using 


double D-loops 


Sample 


Tef Colonies 


Kan R Colonies 


No Dynabeads™ 


>5000 


>5000 


No annealing oligonucleotide 


0 


0 


No RecA 


0 


6 


Complete reaction 


0 


163 



We also test whether the sequence specificity of double D-loop formation 
can be used for purification of a large nucleic acid molecule, for example a YAC. We 
inoculate a 5 ml culture of growth medium with a single colony from a YAC-containing strain 
25 of yeast and allow it to grown overnight until saturated. The following day we inoculate an 
additional 100 ml culture of growth medium with 1 ml of the overnight starter culture and 
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grow this culture overnight until saturated. We use a hemocytometer to determine the cell 
count, which is generally about 1 x 10 8 cells/ml. We harvest the cells by centrifugation at 
1300 x g for 5 minutes and wash the pellet twice with 50 mM EDTA pelleting between 
washes for 5 minutes at 1300 x g. We resuspend the cells in 50 mM EDTA to a 
5 concentration of 2 x 1 0 9 cells/ml and warm the cell suspension to 45°C for 5 minutes. We 
add an equal volume of 1% InCert agarose in 50 mM EDTA, also prewarmed to 45°C. 
Alternatively, we use 1% or 2% SeaPlaque agarose. We mix the suspension by vortexing 
and pipet 500 \\\ aliquots into an agarose plug mold to harden. A 100 ml culture will yield 
about 20 plugs. We allow the plugs to set at room temperature or at 4°C which takes about 
10 15 minutes. 

We extrude each plug into a dish and add 6 ml of freshly prepared yeast 
spheroplasting solution (40 ml 1 M sorbitol; 1.6 ml 0.5 M EDTA, pH 8.0; 0.4 ml 1 M Tris-HCI, 
pH 7.5; 40 \)\ 2-mercaptoethanol; and 40 mg yeast lytic enzyme (ICN)). We incubate the 
plugs at 37°C for 2 to 4 hours with gentle shaking. We aspirate off the spheroplast solution, 

15 add 6 ml of LDS solution (1 % lithium dodecyl sulfate; 100 mM EDTA; and 1 0 mM Tris-HCI, 
pH 8.0) and incubate at 37°C with gentle shaking for 1 hour. We remove the solution and 
add 6 ml fresh LDS solution and incubate with gentle shaking at 37°C overnight. We wash 
the plugs three times with gentle shaking at room temperature for 30 minutes with 6 ml 0.2 x 
NDS (1x NDS is 0.5 M EDTA; 10 mM Tris base; 1% Sarkosyl; pH 9.5). We then wash the 

2 o . plugs five times with gentle shaking at room temperature for 30 minutes with 6 ml TE, pH 
8.0. Plugs are either used directly or stored at 4°C in covered with TE, pH 8.0. 

We form a double D-loop in the YAC DNA in the agarose plug. We coat an 
incoming biotinylated oligonucleotide with RecA using a 5x amount of reactants as 
described in Example 10. We soak the plug in the solution containing the RecA-coated 

2 5 incoming oligonucleotide at 37°C for between 20 minutes and 2 hours. We then add 5x 

volume of annealing oligonucleotide to the plug and soak for an additional 10 minutes to 1 
hour. We insert the plug into a pulse field electrophoresis gel containing low melt agarose 
and a strip of conjugated agarose-streptavidin. We run the pulse field gel such that the 
DNA migrates across the streptavidin containing band allowing the YAC DNA containing the 

3 o biotinylated double-D loop to be captured by the band. We excise the strip, heat it to melt 
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the agarose, and elute the target from the band. We then transform spheroplasted yeast 
cells with the eluant by lithium acetate transformation. 

EXAMPLE 16 

5 

DOUBLE D-LOOP HYBRIDIZATION REACTIONS CAN DISCRIMINATE 
SINGLE BASEPAIR DIFFERENCES IN TARGET SEQUENCES 

We test whether the sequence specificity of double D-loop formation can 
10 be used to discriminate single basepair differences in a target sequence. We form the 
double D-loop as described in Example 10 except that we use a 50:50 mixture of two 
incoming oligonucleotides. 

For example, we use 0.55 pi of an 18 pM solution of LDF/31G (5'-Cy™5- 

GAGGCTATT 

1 5 CGGCTAGGACTGGGCACAACAG-3'; SEQ ID NO: 53) and 0.55 pi of an 1 8 pM solution of 
LDF/31C 

(5'-Cy™3- GAGGCTATTCGGCTACGACTGGGCACAACAG -3'; SEQ ID NO: 54). The 
incoming oligonucleotide LDF/31G is complementary to the sequence of the mutant Kan R 
gene with the nucleotide corresponding to the point mutation centrally positioned and the 

2 o incoming oligonucleotide LDF/31 C is fully complementary to the sequence of a functional 
Kan R gene. The RecA-coated mixture of incoming oligonucleotides is added separately to 
either the Kan- or the Kan + PCR product. We add KM2 (SEQ ID NO: 39) as the annealing 
oligonucleotide. The KM2 oligonucleotide is perfectly complementary to the Kan- target 
sequence. We also perform this experiment with an individual annealing oligonucleotide 

2 5 specific for the Kan + target sequence and with a mixture of the two oligonucleotides. We 
perform these experiments with annealing oligonucleotides comprising a variety of 
combinations of modfied backbones or bases, including, for example, LNA, PNA, 2-0- 
methyl RNA and 2-aminoadenine or cytosine/uracil substituted at the 5 position with a 
methyl, propynyl or bromo group. 

30 We test the stability of the double D-loops formed as described above by 

denaturing the RecA bound to the oligonucleotide:target complex by adding SDS and, 
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10 



15 



20 



25 



30 



optionally KCI, and heating the samples to various temperature, e.g. 37°C, for varying 
periods of time. We then analyze the samples by separating by agarose gel 
electrophoresis. We monitor the stability of the double D-loops under these assay 
conditions by detecting the fluorescent labels on the Cy™3- and Cy™5-labeled 
oligonucleotides. The migration of these labeled oligonucleotides is retarded when they are 
part of a double D-loop complex. 

We observe that double D-loop complexes in which the labeled incoming 
oligonucleotide is mismatched to the template are significantly less stable after denaturing 
the RecA than complexes in which the oligonucleotide is perfectly complementary to the 
template. This difference is readily detectable and after only 2.5 minutes at 37°C a double 
D-loop made with a mismatched annealing oligonucleotide is almost completely 
undetectable. Accordingly, it is possible to determine which target sequence is in a sample 
based on which fluorescently labeled incoming oligonucleotide is present in the complex. 
This result indicates that stable double D-loops may be used to detect a single-nucleotide 
polymorphism in a target sequence or a mixture of target sequences. 

EXAMPLE 17 

c l ?S??n E . D " L00P HY BRIDIZATION REACTIONS CAN DISCRIMINATE 
SINGLE BASEPAIR DIFFERENCES IN A GENOMIC TARGET SEQUENCE 

We test the effect of varying the annealing oligo sequence to double D-loop 
formation efficiency in genomic DNA. We form double D-loops as in Example 16 using a 
50:50 mixture of HYG(NT)D5Cy5/31C(rep) (5'-Cy™5- 

ATnACCCGCAGGACCTATCCACGCCCTCCT-3'; SEQ ID NO: 55) which is perfectly 
matched to a hygromycin resistance gene with a point mutation (Hyg-) and 
HYG/(NT)D5Cy3/31G(cnv) (5-Cy™3-AmACCCGCAGGACGTATCCACGCCCTCCT-3'; 
SEQ ID NO: 56) which is perfectly matched to a copy of the hygromycin resistance gene 
which contains a point mutation but which remains functional (Hyg + ). We add this to a 
genomic prep from two yeast strains, one from Mata-intHyg- which contains an integrated 
Hyg- gene. We compare this to a genomic prep of Mata-intHyg* strain which contains an 
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integrated Hyg + gene, We use the following annealing oligonucleotides which are 
composed of LNA residues except for the underlined bases which are DNA: HygLNA15T 
(5-GTGGATAGGTCCTGC-3'; SEQ ID NO: 57) which is perfectly matched to Hyg-, 
Hyg15LNAT(cnv)C (5'-GTGGATACGTCCTGC-3'; SEQ ID NO: 58) which is perfectly 
5 matched to Hyg* and Hyg15LNAT(wt)T (5'-GTGGATATGTCCTGC-3'; SEQ ID NO: 59) 
which is perfectly matched to the wild-type, functional hygromycin resistance gene 
sequence (Hyg(wt)). We add SDS to remove the RecA at 37°C for 30 seconds, and run on 
a 0.7% agarose gel at4°C. 

We observe efficient double D-loop formation in a genomic DNA target, 

1 o indicating that target size and complexity do not limit the reaction. See Figure 17. We also 

observe a readily detectable effect of the LNA sequence on formation efficiency. The 
presence of a mismatch on the annealing strand significantly destabilizes the molecule, as 
in Example 16. Accordingly, it is possible to discern the sequence of the target from a 
genomic DNA prep with high accuracy. 

15 We also analyze metaphase chromosome spreads obtained from 

mammalian cells, including human cells. For example, we combine 0.5 to 0.8 ml whole 
blood with 0.2 ml phytohemaglutinin (PHA; M-form lyophilized from GibcoBRLor Sigma 
dissolved according to the manufacturer's instructions) and mix gently. The cells are added 
to a flask with 1 0 ml of complete cell culture media and we incubate them at 37°C for 72 

20 hours. We add 0.1 ml Actinomycin-D (5 mg/10 ml water) and incubate for 20 minutes. We 
then add 0.1 ml of colcemid (10 pg/ml; GibcoBRL) and incubate for 10 more minutes. We 
centrifuge the cells at 1000 rpm for 8 minutes, aspirate the supernatant and break up the 
cell pellet using a polyethylene pipet. We lyse the cells by adding prewarmed 37°C 
hypotonic solution (75 mM KCI) drop by drop, mixing gently after each addition wth the 

2 5 pipet, until the final volume reaches about 2 ml. We then add a larger amount of hypotonic 

solution to bring the total volume to 10 ml and incubate at 37°C for 15 minutes. We then at 
1 0 drops of fixative solution (3 parts absolute methanol: 1 part glacial acetic acid) and mix 
with the pipet. We then centrifuge the cells at 1000 rpm for 8 minutes, aspirate the 
supernatant and break up the cell pellet using a polyethylene pipet. We add fixative solution 

3 o drop by drop, mixing gently after each addition wth the pipet, until the final volume reaches 

about 2 ml. We then add a larger amount of fixative solution to bring the total volume to 10 
ml. We then use these cells directly or store them in a refrigerator overnight. 
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We make slides with the cells as follows. We pellet the cells 20 minutes 
after the first addition of fixative solution by centrifuging at 1000 rpm for 8 minutes. We 
remove the supernatant using an aspirator, break up the cell pellet with the pipet, and 
resuspend in 10 ml of fixative solution by mixing gently. We repeat the pelleting and 
resuspending steps two more times. After the final resuspension, we leave 0.5 ml to 3.0 ml 
of fixative solution above the cell pellet, mix and drop four to six drops of cell suspension 
onto a clean wet slide. We then allow the slide to either air dry or place it on a hot plate at 
55°C-60°Ctodry. 

We test the ability of the double D-loop formed according to the methods of 
the invention to discriminate single basepair differences in a genomic target sequence. We 
incubate the slides with two RecA-coated incoming oligonucleotides, each specific for a 
target sequence corresponding to the two interrogated alleles and each separately labeled, 
for example with Cy™-3 and Cy™5. We then add two annealing oligonucleotides 
complementary to the incoming oligonucleotides. We determine the sequence of the target 
by destabilizing the mismatched double D-loop by denaturing RecA and detecting the label 
on the oligonucleotide in the resulting double D-loop. 

Alternatively, for detection of a target sequence in the genome, the slides 
are incubated sequentially with incoming and annealing oligonucleotides complementary to 
a desired genomic target. One of these oligonucleotides is labeled with a detectable moiety 
20 which is monitored to detect the formation of a stable double D-loop. 



15 



EXAMPLE 18 



ASSESSMENT OF GENE AMPLIFICATION BY DETECTING DOUBLE D-LOOP 



25 FORMATION 



We test the ability of double D-loop formation to detect multiple copies of a 
gene in a genome. We use two strains of yeast containing one or multiple copies of the 
Hyg(rep) gene. We extract the genomic DNA and form double D-loops using the same 
conditions as described in Example 17, using HygUDF45G as the incoming oligo, and 
HygLNA15T as the annealing oligo. We run the reaction on a 1% agarose gel and quantify 
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the amount of double D-loop formed using Molecular Dynamics ImageQuant™ and a 
Typhoon™ imager. 

We observe that the intensity of double D-loop band increases in proportion 
to copy number of the inserted gene. This indicates that detection of double D-loop 
5 formation can be used to quantify amplification of a target duplex nucleic acid molecule, 
including a target gene such a ERB2 and c-Myc. 



EXAMPLE 19 

1 o SEQUENCE SPECIFIC CLEAVAGE OF NUCLEIC ACID MOLECULES USING DOUBLE 

D-LOOPS 



We test whether the sequence specificity of double D-loop formation can 
be used to direct cleavage of a target nucleic acid molecule at a desired location. We form 

1 5 double D-loop targets by sequential hybridization. We combine oligonucleotides in two 
separate tubes as follows. In one tube we combine 7 pi of a 4 pM solution of 32 P-labeled 
70mer oligonucleotide (OligoA; SEQ ID NO: 1); 1.9 pi of a 13 pM solution of a 25mer 
oligonucleotide (LD25G; 5'-GCTATTCGGCTAGGACTGGGCACAA-3'; SEQ ID NO: 60); 
and 0.75 pi 10x hybridization buffer (100 mM Tris-HCI pH 7.5). In a second tube we 

2 o . combine 1 .92 pi of a 1 2.5 pM solution of another 70mer oligonucleotide (OligoB; SEQ ID 
NO: 2); 1 .46 pi of a 16.4 pM solution of another 25mer oligonucleotide (UD25C; 5- 
TTGTGCCCAGTCCTAGCCGAATAGC-3'; SEQ ID NO: 61); 0.76 pi 10x hybridization buffer; 
and 3.46 pi water. These oligonucleotides are complementary to each other as follows: 
OligoA and OligoB are complementary; LD25G and UD25C are complementary; LD25G is 

2 5 complementary to OligoA such that LD25G hybridizes approximately in the center of OligoA; 

and UD25C is complementary to OligoB such that UD25C hybridizes approximately in the 
center of OligoB. We heat each of the separate tubes to 95°C for 2 minutes and then cool 
to 60°C for 20 minutes. This allows for OligoA/LD25G and OligoB/UD25C duplexes to form 
in the separate tubes. We then mix the tubes and incubate for 5 minutes at 37°C. This 

3 o allows the overhanging ends on OligoA and OligoB to hybridize forming a double D-loop 
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15 



5 



structure. We then cool the samples to 4°C and separate the samples on a non-denaturing 
12% polyacrylamide gel run at 4°C for 2.5 hours at 8 W. 

We detect the location of the double D-loops in the polyacrylamide gel by 
autoradiography and excise the band corresponding to the double D-loop. We incubate the 
polyacrylamide gel slice containing the double D-loops at 4°C overnight in 1 ml of 2 mM 
Mg(acetate) 2 to elute the double D-loops from the gel slice. We transfer 250 pi of the 
solution containing the double D-loops into four different microfuge tubes and add 750 pi 
cold ethanol and 1 pg of poly dl-dC as a DNA carrier. We incubate this sample at 4°C 
overnight and pellet the precipitate double D-loops by centrifugation at in a microcentrifuge 
at 4°C for 30 minutes at 13,500 rpm. We aspirate the supernatant and wash the pellet by 
adding 200 pi 70% ethanol, centrifuging at 4°C for 15 minutes at 1 3,500 rpm and aspirating 
the supernatant. We dissolve the double D-loops in 100 pi of 1x TBM (90 mM Tris-borate; 1 
mM MgCb). We either use the double D-loops immediately or store at -20°C. 

We combine 2 pi of 32P-| a beled double D-loops in a reaction mix with 1 pi 
reaction buffer (300 mM BisTris-HCI pH 7.0; 500 mM KCI; 25 mM MnCI 2 ; 500 pg/ml BSA 
and 10 mM DTT), 0.5 pi MRE1 1 protein purified from Saccaromyces cerevisiae, 20 mM 
ATP and 4.5 pi water. Optionally, we include 0.5 pi RAD50 purified from Saccaromyces 
cerevisiae. If RAD50 is added, we add 4 pi water. We incubate this mixture for 30 minutes 
at 37°C to allow MRE1 1-mediated cleavage of the target. We separate the reaction by 
either a non-denaturing 12% polyacrylamide gel electrophoresis or by denaturing (7M urea) 
20% polyacrylamide gele electrophoresis. 

We observe approximately 60% cleavage of the target nucleic acid 
molecule, i.e. approximately 40% of the ^-labeled 70mer oligonucleotide. The cleavage 
that we observe is highly specific, with about 80% of cleavage occuring at the ends of the 
double D-loop. The localization of the cleavage site to the junction of the double D-loop at 
the 5' end of the incoming and annealing oligonucleotides indicates that it is possible to 
select a specific cleavage site by selecting specific incoming and annealing 
oligonucleotides. Accordingly, it is possible using this method to site-specifically cleave at 
any given base in a nucleic acid target with a defined sequence. 
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EXAMPLE 20 

DOUBLE D-LOOP FORMATION IN MEMBRANE-BOUND NUCLEIC ACID TARGETS 

5 We test the efficiency of double D-loop formation on a target crosslinked to 

a membrane. We crosslink via a Stratalink various concentrations of linear Hyg- plasmid 
onto a Hybond-N+ (Amersham) membrane. We block the membrane by incubating at room 
temperature for 30 minutes with various concentrations of Denhardf s solution (100x: 2% 
BSA, 2% Ficoll, 2% PVP (polyvinylpyrrodilone)). We separately form the RecA filament 

l o using the same conditions as Example 10, with HygUDF45G with 5x the amount of 
reactants. We dilute the reaction to a final volume of 1 mL in 1x Synaptic buffer. 
Subsequently, we add the membrane to the reaction and incubate for 20 minutes at 37°C. 
We then add the 3.7 uL of 27 uM HYGLNA15T, and incubate 10 minutes at 37°C. We 
subsequently wash the membrane in various concentrations of SSC (20x: 3M NaCI, .3M 

1 5 Na 3 Citrate) at elevated temperatures (37-65°C), and visualize on the Molecular dynamics 
Typhoon™ Imager. 

We observe efficient formation of double D-loop in a target crosslinked to a 
membrane. This results demonstrates that the methods of the invention can be used to 
form double D-loops in DNA crosslinked to a solid support such as a membrane, glass slide, 
20 or 96 well plate, with no serious detrimental effects. We can, thus, form a double-D loop 
sequence specifically, and visualize its structure without running a gel. Visualization of the 
• formation of a stable double D-loop with a perfectly matched oligonucleotide as compared to 
the absence of a stable double D-loop structure with a mismatched oligonucleotide allows 
easy visualization of single nucleotide polymorphisms (SNPs). 
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EXAMPLE 21 

SINGLE NUCLEOTIDE MISMATCH DISCRIMINATION IN AN 8 KB PLASMID TARGET 
USING PNA ANNEALING OLIGONUCLEOTIDES 



In this example, we test the ability to discriminate single nucleotide 
differences in an 8 kb nonsupercoiled plasmid target using annealing oligonucleotides 
composed of PNA. 



10 Materials and Methods 
Target plasmids : 

mt ("mutant") : pAURHyg (rep) eGFP 

15 

WT ("wild type"): pAURHyg (wt) eGFP 

The plasmids are each 7982 bp, and differ in a single nucleotide. As a 
consequence of plasmid purification, a subset of the plasmids are no longer supercoiled, 
2 o having been nicked or otherwise broken. Upon gel electrophoresis, a single band having 
various nonsupercoiled topoisomers can be observed. 



Incoming oligonucleotides: 

2 5 Incoming oligonucleotides are HPLC purified, and have the following 

sequences. 



mt: HYG(NT)D5Cy5/31C(rep) 

(5 ' -Cy5-atttacccgcaggacctatccacgccctcct-3 ' ) 
WT: HYG(NT)D5Cy3/31A(wt) 

(5' -Cy3-atttacccgcaggacatatccacgccctcct-3 ' ) 



35 Annealing oligonucleotides: 
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Annealing oligonucleotides are composed of PNA, with an acetyl group 
("Ac") to block a free reactive primary amine. The annealing oligonucleotides are used as 
received from Applied Biosystems, Inc. (crude prep). 

5 mt : HYG (T) PNA/15G (Ac-gtggataggtcctgc) 

WT: HYG (T) PNA/15T (Ac-gtggatatgtcctgc) 

10 Protocol : 

1. Prepare reaction mix (add RecA last) as' follows: 

15 1.1 uL incoming oligo mix (18jtzM) 

0.55 pL of 18 yM Hyg (nt) D5Cy5/31C (rep) 
(Cy-5 labeled) 

0.55 ixh of 18 pM Hyg(nt)D5Cy3/31A(wt) 
(Cy-3 labeled) 
2 0 3 uL 5X Synaptic Buffer 

1.5 10 mM ATPyS 

7.4 ixh dH20 

2 jxL RecA (10.9 uM) 
15 \xL 

25 

2. Incubate for 10 minutes at 37°C 

3. Add the following: 

30 2.5 uL (500ng) nonsupercoiled target 

Hyg (rep) or Hyg(wt) target 

2.5 uL Mg(OAc) 2 (74 mM) 
20 JUL 

35 4. Incubate for 2 0 minutes at 37°C 

5. Add one or the other annealing oligo (at 36.5 /zM) 

.50 pL HYG(T) PNA/15G or 
40 .50 jiL HYG (T) PNA/15T 

6. Incubate for 5 minutes at 37°C 



7. 

45 



Stop reaction with 2 pL 10% SDS and incubate 
respectively under the following conditions: 
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15 



1. 
2. 
3. 
4. 
5. 

6. 
7. 
8. 
9. 

10. 



45°C 



Hyg (rep) 



37°C 0 min 
1 min 
2 . 5 min 
5 min 
10 min 



0 min 

1 min 

2 . 5 min 
5 min 
10 min 



11. 
12. 
13. 
14. 
15. 

16. 
17. 
18. 
19. 
20. 



Hyg(wt) 



37°C 



45°C 



0 min 

1 min 

2 . 5 min 
5 min 
10 min 

0 min 

1 min 

2 . 5 min 
5 min 
10 min 



20 



35 



After the indicated incubation time, place samples 
on dry ice. 



8. Run gel: 



Add 10 uL of 10X loading dye (w/ 25% ficoll) . Run 
on a 1X TBE ' agarose gel without EtBr at 4°C 
25 Run gels at 120V for about 2 hours. 

9- Scan on Typhoon Imager 

30 10 ' ?ffi)" 0nal POSt " Stainin 9 (not sh °™ in PIGS. ISA and 

Stain with IX SYBR green and scan on Typhoon 
imager 



Results 



The target in these experiments is one of two nonsupercoiled plasmids of 
about 8 kb, differing in a single nucleotide. The incoming oligonucleotides are 31-mer 
deoxyribonucleotides labeled at the 5 1 terminus with either Cy5 or Cy3; the annealing 
4 o oligonucleotides are 15-mer PNA oligonucleotides acetylated at the N terminus. 

Deproteinization is performed by adding SDS at either 37°C or 45°C to aliquots of the 
reaction mixture and incubating at the indicated temperature for various amounts of time, 
after which time the aliquots are removed and placed on dry ice. 
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Two gels are run separately, but scanned at the same time to allow a direct 
comparison. Results are shown in FIGS. 18A and 18B for the 37°C deproteinization data 
only. 

Despite the small size of the incoming and annealing oligonucleotides and 
5 the large target (8 kb), results shown in FIG. 18A (wt target pAURHyg(wt)eGFP) and FIG. 
18B (mt target pAURHyg(rep)eGFP) demonstrate that the single nucleotide difference in the 
target query region can readily be distinguished, with stable double D loops (stable to 
deproteinization and subsequent electrophoresis) observed only when the annealing 
oligonucleotide and incoming oligonucleotide exactly match the target. 

1 o Furthermore, discrimination among target variants differing by a single 

nucleotide is achieved in the presence of a mixture of differentially labeled incoming 
oligonucleotides, demonstrating the ability to multiplex these assays. 

In additionally, the gels demonstrate that the targets having stable double D 
loops are readily separated from those without, making possible the isolation of desired 
15 allelic variants. 

Although lanes 6 and 7 of FIG, 18B suggest that stable double D loops are 
formed on the mutant (mt) target with a mismatched annealing oligonucleotide (wt PNA), 
these data are not representative, and may be due to the bright background in this part of 
the gel, which may artificially enhance the signals to detectable levels; most of our gels 

2 o show no formation of stable double D loops unless all oligos (incoming and annealing) are 

perfectly matched to target. 

In data that are not shown, the stringent requirement for both a perfectly 
matched incoming and perfectly matched annealing oligonucleotide to effect stable D loop 
formation is not observed with supercoiled plasmids. 

2 5 For example, in the absence of a perfectly matched annealing 

oligonucleotide, we observe that perfectly matched incoming oligonucleotides can form 
stable single D loops, whereas incoming oligonucleotides mismatched by a single 
nucleotide do not. In the presence of a perfectly matched annealing oligonucleotide, stable 
double D loop formation is observed even when the incoming oligonucleotide is mismatched 

30 to target by one nucleotide. In this latter case, discrimination of allelic targets is enhanced 
when the label is on the annealing oligonucleotide. 
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EXAMPLE 22 

SINGLE NUCLEOTIDE MISMATCH DISCRIMINATION IN AN 8 KB PLASMID TARGET 
USING LNA ANNEALING OLIGONUCLEOTIDES 

5 

In this example, we detect a single nucleotide mismatch in an 8 kb plasmid 
target using LNA annealing oligonucleotides. 



Materials and Methods 
Target plasmids : 

Targets are the same as in Example 21: 8 kb 
nonsupercoiled plasmids differing in a sinq] 
nucleotide. ^ 

mt : pAURHyg ( rep ) eGFP 

WT: pAURHyg (wt ) eGFP 

Incoming oligonucleotides : 

Incoming oligonucleotides are HPLC purified, 
mt : HYG (NT) D5Cy5/3 1C (rep) 

(5' -Cy5-atttacccgcaggacctatccacgccctcct-3M 
(SEQ ID NO: 55) 

WT : HYG (NT) D5Cy3 / 3 1A ( wt ) 

(5 ' Cy3-atttacccgcaggacatatccacgccctcct-3 ' ) 
(SEQ ID NO: 62) 



Annealing olicros : 

Annealing oligos containing LNA residues, 
synthesized by Proligo, are used as a crude 
prep. In the sequences listed below, LNA 
residues are prefixed by a » + « sign and 
deoxyribonucleotide residues are prefixed by 
a lower case "d" . 
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mt: HYGLNA15T 

(5 # -+G+TdG+G+A+TdA+G+G+TdC+C+T+G+C-3 ' -3 ' ) 
[SEQ ID NO: 57] 

5 WT: HYG (T) LNA15T (wt) 

(5 ' -+G+TdG+G+A+TdA+T+G+TdC+C+T+G+C-3 ' ) 
[SEQ ID NO: 63] 



Protocol : 

10 



Reactions are essentially as set forth in 
Example 21. 



15 Results 

FIG. 19 shows data from reactions stopped by addition of SDS at 37°C 
followed by immediate freezing on dry ice; FIG. 20 shows data from reactions stopped by 
addition of SDS at 37°C followed by 2.5 minute and 5 minute incubations at 37°C, as 
2 o indicated, prior to freezing on dry ice. 

As in Example 21 , results shown in FIGS. 19 and 20 demonstrate that 
stable double D loops are formed only when both annealing and incoming oligonucleotides 
are perfectly matched to target. In contrast to Example 21 , the annealing oligonucleotide 
comprises LNA residues, rather than PNA residues. 

2 5 The data further demonstrate that single nucleotide variants of this 8 kb 

nonsupercoiled plasmid can be distinguished when both the incoming and annealing 
oligonucleotides are present in admixture, demonstrating the robust multiplexing capabilities 
of this approach. 
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EXAMPLE 23 

SINGLE NUCLEOTIDE MISMATCH DISCRIMINATION IN A LINEAR TARGET WITHOUT 
COMPETING OLIGONUCLEOTIDES 

5 

The preceding two examples demonstrate that our assay provides 
excellent discrimination between two 8 kb plasmid targets differing by a single nucleotide, 
under conditions in which incoming oligonucleotides and/or annealing oligonucleotides that 
are perfectly matched to target are present in admixture with oligonucleotides that are not 
l o perfectly matched to target. In this example, we test whether similar selectivity (mismatch 
discrimination) is observed if no competing oligonucleotides are present. 

In addition, the preceding two examples do not distinguish among various 
nonsupercoiled topoisomers of the 8 kb plasmid (linear, nicked, covalent closed circular). In 
this example, we test whether equivalent selectivity is observed in a linear target as large as 
8 kb. In these experiments, therefore, we use one target, 8kb EcoRV-linearized 
pAURHyg(rep)eGFP plasmid, and the indicated combinations of oligonucleotides. 
Materials and Methods 
Target : 

8kb EcoRV-linearized pAURHyg (rep) eGFP 
Incoming oligos : 

HPLC purified 
mt: HYG(NT)D5Cy5/31C(rep) 

(5' -Cy5-atttacccgcaggacctatccacgccctcct-3' ) 
[SEQ ID NO: 55] 

WT: HYG(NT)D5Cy3/31A(wt) 

(5' -Cy3-atttacccgcaggacatatccacgccctcct-3' ) 
[SEQ ID NO: 62] 

Annealing oligos : 

Synthesized by Proligo, used as a crude prep. 
LNA residues are prefixed by "+"; 
deoxyribonucleo tides are prefixed by "d". 
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•mt: HYGLNA15T 
(5' -+G+TdG+G+A+TdA+G+G+TdC+C+T+G+C-3' ) 
[SEQ ID NO: 57] 

WT: HYG (T) LNA15T (wt ) 

(5 ' -+G+TdG+G+A+TdA+T+G+TdC+C+T+G+C-3 ' ) 
[SEQ ID NO: 63] 



10 Protocol : 

1. Prepare reaction mix 

1.1 jiL Incoming oligo (18^M) 
2 |UL lOx Syn Comp Buffer* 
15 12.74 nL dH20 

1.16 iih RecA (10.9 fM) (added last) 
17 ^iL total 

2. Incubate for 10 minutes at 3 7°C 

20 

3. Add the following: 

2 \xh Linear Hyg(rep) (250 ng/\xl) 
19 jxL total 

25 

4. Incubate for 10 minutes at 37°C 

5. Add 1 (iL of annealing oligo (38.5 MM) 

3 0 6. Incubate for 10 minutes at 37°C 

7. Stop reaction with 2 \ib 10%SDS, put on ice or dry 
ice . 

35 8 . Run gel 



Add 10 jiL of 10X loading dye (w/ 25% ficoll) 
Run on a 1XTBE, 1% agarose gel without EtBr 



40 * lOx Syn Comp Buffer: 

250mM Tris-OAc pH7.15 
50mM Mg(OAc) 2 
lOmM DTT 
lOmM ATP7S 
45 5X Denhardts 

(lg f 10011400, Ig PVP and lg 
BSA Fraction V in 50ml H 2 0) 
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Results 

Results are shown in FIG. 21 : times represent duration of incubation at 
5 37°C following addition of SDS and before chilling. 

As can readily be seen, double D loops are seen when both annealing and 
incoming oligonucleotides are perfectly matched to the target area of the 8 kb linear target. 

EXAMPLE 24 

0 LABELING OF ANNEALING OLIGONUCLEOTIDE 

In the preceding three examples (Examples 21, 22, and 23), only the 
incoming oligonucleotides are labeled. In this example, we compare labeling of incoming 
oligonucleotides with labeling of annealing oligonucleotides using an 8 kb linearized plasmid 
5 target. All oligonucleotides are perfectly matched to target. 



Materials and Methods 

Target : 

Linear pAURHyg(rep)eGFP 

Incoming oligonucleotides ; 

Incoming oligonucleotides are HPLC purified 
and have the following sequences: 

HYG/UD/45G 

5 ' -cgcagctatttacccgcaggacctatccacgccctccta 
catcga-3' 
[SEQ ID NO: 64] 

HYG/UDF/45G 
5'-Cy5- 

cgcagctatttacccgcaggacctatccacgccctcctaca 
tcga-3' 

[SEQ ID NO: 65] 
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Annealing oligonucleotides : 



5 



LNA- containing annealing oligonucleotides are 
used as a crude prep, as synthesized by 
Proligo, LNA residues are prefixed by " + " , 
deoxyribonucleotides are prefixed by "d". 



10 



HYGLNA15T 

5 < -+G+TdG+G+A+TdA+G+G+TdC+C+T+G+C-3 » 
[SEQ ID NO: 57] 



15 



HYG(T) /lSGrCB 

(5 1 -/Cy5/+G+TdG+G+A+TdA+G+G+TdC+C+T+G+C- 
/BioTEG/3 1 ) 
[SEQ ID NO: 66] 



Results 



FIG. 22 demonstrates that either the incoming or annealing oligo 



2 o nucleotides may be labeled. Under the indicated conditions, signal is stronger when the 

annealing oligonucleotide, rather than incoming oligonucleotide, is labeled. Not shown, both 
may be labeled. 



In this experiment we test for the optimal temperature for forming dDloops 
in linear duplex DNA. 

We use reaction conditions and oligonucleotides as in Example 24 (FIG. 
3 o 22), except that incubation temperatures are varied. The reactions are incubated for each 
step of complex formation at 8 temperatures: 22, 27, 32, 37, 45, 50, 55, and 60 degrees 
Celsius. 

The oligonucleotides are Cy5 labeled and unlabeled 45mers as the 
incoming oligonucleotide, and Cy5 labeled and unlabeled LNA 15mers as the annealing 
3 5 oligonucleotide. The final concentration of the reaction is 7.5 \M RecA, 1 jxM incoming 



EXAMPLE 25 



25 



TEMPERATURE EXPERIMENTS 
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Oligo, 1x synaptic comp buffer, 4.7nM target (linearized pAurHyg(rep)eGFP plasmid), 1.825 
|iM annealing oligo. 

In the first reaction of each temperature, Cy5 labeled incoming is used 
alone to show the need for the annealing oligo. In the second reaction, Cy5 labeled 
5 incoming is used with an unlabelled 15mer LNA. In the third, unlabelled incoming is used 
with a labeled Cy5 annealing. And in the fourth, the Cy5 labeled annealing is used without 
an incoming to show the need for the RecA filament. 

Results are shown in FIG. 23. As can be seen, minimal formation of 
double D-loops (indicated by arrows) is seen with incubations below about 37°C, with 

1 o increasing formation from 37°C to about 55°C. At 65°C, formation is reduced. 

Thus, optimal temperature for double D-loop formation on this target is 50 - 
55°C using a 45-mer incoming oligonucleotide and 15-mer LNA containing annealing 
oligonucleotide. 

15 EXAMPLE 26 

MIXED ANNEALING OLIGONUCLEOTIDES 

In this experiment, we test the effect of using mixed incoming 
oligonucleotides - one perfectly matched to the target, one mismatched to target - with 

2 o mixed annealing oligonucleotides. Label is on the annealing oligonucleotides. 

Results are shown in FIG. 24. 

Comparing lanes 2 and 3, addition of a Cy5-labeled mismatched annealing 
oligonucleotide does not adversely affect the selective formation of a double D-loop at the 
Mt target. Analogously comparing lanes 6 and 4, addition of the Mt annealing 
2 5 oligonucleotide does not reduce selective formation of a double D-loop at the CN V target. 

In fact, we observe that mismatch discrimination directed by mixed labeled 
annealing oligos is superior to mismatch discrimination using labeled incoming oligos: we 
get enhanced sensitivity of the assay with reduced background; clean mismatch 
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discrimination is observed regardless of the topology of the DNA target, with linear and 
supercoiled DNA targets showing clean mismatched discrimination with no mixed signals. 

EXAMPLE 27 

5 LENGTH DEPENDENCE OF MISMATCH DISCRIMINATION 

In this example, experiments are performed to assess the dependence of 
mismatch discrimination on the length of incoming and annealing oligonucleotides. 

In each experiment, 1.1 ^l of 18 ^M mixed incoming oligonucleotides 
l o (exactly matching mutant ("mt") and "converted" ("cnv") targets, which differ by a single 

nucleotide) and 1 pi of 36.5 ^M mixed annealing oligonucleotides (exactly matching mutant 
and "converted" targets, which differ by a single nucleotide) are used. Incoming 
oligonucleotides are labeled with Cy3 (mt) and Cy5 (cnv). Annealing oligonucleotides are 
unlabeled and are composed of 2'OMe residues (indicated by "f prefix). Sequences are 
15 listed below. 

About 500 ng of linear 8 kb pAURHygeGFP plasmid and 250 ng of 
"converted" plasmid is used separately as target DNA. 

Reaction conditions for forming double D-loops are essentially as set forth 
in Example 21, above, with incubations at 37°C. 

20 

Sequences 

2'-OMe residues are prefixed with "m". 
Incoming : 

25 

23-mers : 

HYG (NT) D5Cy5/23C (rep) 
30 5' -Cy5-acccgcaggacctatccacgccc-3' 

[SEQ ID NO: 67] 
lOOnM 
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HPLC purification 

HYG/ (NT)D5Cy3/23G(cnv) 

5 ' -Cy3 -acccgcaggacgtatccacgccc-3 ' 

[SEQ ID NO: 68] 

lOOnM 

HPLC purification 



10 27-mers: 

HYG (NT) D5Cy5/27C (rep) 

5' -Cy5-ttacccgcaggacctatccacgccctc-3' 
[SEQ ID NO: 69] 
15 lOOnM 

HPLC purification 

HYG/ (NT) D5Cy3/27G (cnv) 

5' -Cy3-ttacccgcaggacgtatccacgccctc-3 ' 
20 [SEQ ID NO: 70] 

lOOnM 

HPLC purification 



25 35-mers: 

HYG (NT) D5Cy5/35C (rep) 

5' -Cy5-ctatttacccgcaggacctatccacgccctcctac-3' 
[SEQ ID NO: 71] 

30 lOOnM 

HPLC purification 

HYG/ (NT) D5Cy3/3 5G (cnv) 

5 ' " c y 3 "* ctattta cccgcaggacgtatccacgccctcctac-3 / 
35 [SEQ ID NO: 72] 

lOOnM 

HPLC purification 
39-mers : 

40 

HYG(NT)D5Cy5/39C(rep) 

5' -Cy5-agctatttacccgcaggacctatccacgccctcctacat-3 ' 
[SEQ ID NO: 73] 

lOOnM 

45 HPLC purification 

HYG/ (NT)D5Cy3/39G(cnv) 

5' -Cy3-agctatttacccgcaggacgtatccacgccctcctacat-3' 
[SEQ ID NO: 74] 
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lOOnM 

HPLC purification 
43-mers : 

5 

HYG (NT) D5Cy5/43C (rep) 
5'-Cy5- 

gcagctatttacccgcaggacctatccacgccctcctacatcg-3' 
[SEQ ID NO: 75] 
10 lOOnM 

HPLC purification 

HYG/ (NT) D5Cy3/43G (cnv) 
5' -Cy3- 

15 gcagctatttacccgcaggacgtatccacgccctcctacatcg-3 ' 

[SEQ ID NO: 76] 
lOOnM 

HPLC purification 

20 

Annealing : 

9-mers : 

HYG (T) R20Me/9G (rep) 
5 ' -mGmAmUmAmGmGmUmCmC-3 ' 
[SEQ ID NO: 77] 
lOOnM 

HPLC purification 

HYG (T) R20Me/9C (cnv) 
5' - mGmAmUmAmCmGmUmCmC 
[SEQ ID NO: 78] 
lOOnM 

HPLC purification 

13-mers 

HYG (T) R20Me/13G (rep) 
40 5 ' -mUmGmGmAmUmAmGmGmUmCmCmUmG- 3 ' 

[SEQ ID NO: 79] 
lOOnM 

HPLC purification 

45 HYG (T) R20Me/13C (cnv) 

5 ' -mUmGmGmAmUmAmCmGmUmCmCmUmG-3 ' 

[SEQ ID NO: 80] 

lOOnM 

HPLC purification 



25 



30 



35 
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17-mers 

HYG (T) R20Me/l7G (rep) 

5' ~mCmGmUmGmGmAmUmAmGmGmUmCmCmUmGmCmG-3 ' 

[SEQ ID NO: 81] 

lOOnM 

HPLC purification 
HYG (T) R20Me/l7C (cnv) 

5 ' -mCmGmUmGmGmAmUmAmCmGmUmCmCmUmGmCmG-3 ' 

[SEQ ID NO: '82] 

lOOnM 

HPLC purification 



21-rners 

20 HYG (T) R20Me/21G (rep) 

5 ' -mGmGmCmGmUmGmGtnAmUrrAmGmGTnUmCmCmUmGmCmGmGmG-3 ' 
[SEQ ID NO: 83] 

lOOnM 

HPLC purification 

25 

HYG(T)R20Me/21C(cnv) 

5 ' -mGmGmCmGmUmGmGmAmUmAmCmGmUmCmCmUmGmCmGmGmG-3 ' 
[SEQ ID NO: 84] 

lOOnM 

3 0 HPLC purification 

25-mers 

35 HYG (T) R20Me/25G (rep) 

5 ' - 

mAmGmGmGmCmGmUmGmGmAmUmAmGmGmUmCm 

[SEQ ID NO: 85] 
40 lOOnM 

HPLC purification 

HYG (T) R20Me/25C (cnv) 
5'- 

45 ^™ GmGmGm ^^ 

[SEQ ID NO: 86] 
lOOnM 

HPLC purification 
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Results 

Results are shown in FIG. 25, and demonstrate that the ability to 
5 discriminate mismatches is strongly dependent upon the size of both the incoming and 
annealing oligonucleotides, providing a narrow range for mismatch discrimination. 

Under the conditions tested, the annealing oligonucleotide must be more 
than 13 nt in length to achieve dDloop formation, and can be as large as 25 nt, The 
incoming oligonucleotide must be at least 23 nt in length and can be as large as 35 nt for 
l o precise genotyping, with incoming oligonucleotides of 39 nt or larger giving a mixed signal, 
losing mismatch fidelity. 

Under the conditions tested, a 27-mer incoming oligonucleotide and 17-mer 
annealing oligonucleotide are optimal, both for yield of double D-loops as well as fidelity of 
mismatch discrimination, 

15 

EXAMPLE 28 
SENSITIVITY 



Experiments are performed to determine the target detection sensitivity of 

20 the methods. 

In a first series of experiments, dDloops are formed on an 8 kb linearized 
plasmid using 32 P-radiolabeled oligonucleotides. In a second series of experiments, 
dDloops are formed on the identical target using fluorescently labeled oligonucleotides. 

FIG. 26 demonstrates that, using a fluorescent label, an 8 kb linear plasmid 
2 5 target can be detected down to 1 .0 ng. 
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Materials and Methods 

Radioactive dDLoop Synthesis 

5 A 1 5-minute presynapsis step is performed at 37°C using 40 nM ^P- 

HygUD45G (NT) incoming oligonucleotide and 1.5 nM RecA in a solution containing 1 mM 
ATP Y S, 25 mM Tris-OAc (pH 7.5), 1 mM Mg(OAc) 2l and 1 mM DTT. The synapsis phase is 
begun with the addition of double stranded target and an additional 9 mM Mg(OAc) 2 for 5 
minutes. 

1 0 The amount of double stranded target is varied. When using 

pAURHyg(rep) supercoiled plasmid, the final concentration of the target in solution is 10 nM. 
Linear pAURHyg(rep) target concentration is 0.73 ng. Sacchammyces cerevisiae 
replacement genomic DNA is 3.65 nM. Sacharomyces cerevisae replacement genomic 
DNA is 7.3 jig/p.1. HT1080 human genomic DNA is 7.3 ng/nl. 

1 5 Double displacement loops (dD-loops) are formed with the addition of 

required 320 nM 32P-15mer O-methyl RNA annealing oligonucleotide (to T strand), followed 
by an additional incubation at 37°C occurred for 10 minutes. After stopping the reaction in 
dry ice, the joint molecules are deproteinated by the addition of 1% SDS and associated 
protein removed by adding KCI (4°C) to a 100 mM concentration and spinning the solution 

20 at 8,000 rpm for 5 min at 4°C. 

The presence of joint molecules is confirmed by 1% agarose gel 
electrophoresis (1 X Tris-borate-EDTA, 97 V, 1 .5 h). Gels are stained in EtBr for detection of 
double stranded target before being dried on Whatman DE81 filter paper using a Gel Dryer 
Vacuum System (FisherBioTech) for 2.5 h at 80°C and visualized by using a Typhoon 8600 
2 5 Phosphorlmager (Molecular Dynamics Inc., CA). 

Quantification of joint complexes is done using ImageQuant 5.2 software. 
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EXAMPLE 29 
DETECTION OF PGR PRODUCT 



This Example demonstrates sequence-specific detection of a clinically 
5 relevant gene target present within a PCR amplification product. The PGR product is 
purified from the amplification template. 

The Example further demonstrates the ability to detect double D loops 
formed using oligonucleotides that have biotin as capture moiety. 



10 Target and Oligonucleotides 



Wildtype p-tubulin sequence: 

CCCGCCCCGCGGCCTGAAGATGTCGGCCACCTTCATCGGCAACAGCACGGC 
[SEQ ID NO: 87] 

15 

jffTUBwt/31D/5'Bio (incoming oligonucleotide) (has 5 1 
biotin 

5' -GGCCTGAAGATGTCGGCCACCTTCATCGGCA-3 ' 
[SEQ ID NO: 8 8] 

20 

j3TUBwt/l5L/5Cy5 (annealing oligonucleotide) (LNA) 
5 ' - AAGGTGGCCGACATC - 3 ' 
[SEQ ID NO: 89 ] 

25 Results 

Results are shown in FIGS. 27A and 27B. 

FIG. 27A shows that double D loops are detectable in the PCR product 
only when RecA and both incoming and annealing oligonucleotides are present, FIG. 27B 
30 is the same gel, post-stained with ethidium bromide, showing approximately equal amounts 
of target in each reaction. 
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EXAMPLE 30 
CONTEXT INDEPENDENCE OF DETECTION 

Specific alleles of genes relevant to human disease are discriminably 
5 detected in PCR amplification products using identical conditions, indicating that allelic 
discrimination is possible independently of the genomic context of the polymorphism. The 
genes are k-RAS, p53, and beta globin. Annealing oligonucleotides with PNA (designated 
by IP in the oligonucleotide name) and LNA (designated by II in the oligonucleotide name) 
are each used effectively. 

1 0 The oligonucleotides used are the following: 



P5 3 c d2 4 1 wt /D / Cy 5 
[SEQ ID NO: 90] 

15 5' -/5Cy5/atgccgcccatgcaggaactgttacacatgt/3BioTEG/-3' 

P53cd241mt/D/Cy3 
[SEQ ID NO: 91] 

2 0 5 ' " ^ 5Cy3 ^ at 9CCgcccatgcagaaactgt t acacatgt / 3 BioTEG /-3' 

P53cd241wt/P 
[SEQ ID NO: 92] 
5' -aacagttcctgcatg-3' 

25 P53cd241mt/P 

[SEQ ID NO: 93] 

5' -aacagtttctgcatg-3' 

P53cd241wt/L 

3 0 [SEQ ID NO: 94] 

5' -aacagttcctgcatg-3 ' 

P53cd241mt/L 
[SEQ ID NO: 95] 
35 5' -aacagtttctgcatg-3' 



K-ras separation/diagnostic oligos 

40 

Krascdl3wt/D/Cy5 
[SEQ ID NO: 96] 
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5' ^5Cy5/ggcactcttgcctacgccaccagctccaact/3BioTEG/-3' 
Krascdl3mt/D/Cy3 

5' -/5Cy3/ggcactcttgcctacgtcaccagctccaact/3BioTEG/-3' 
5 [SEQ ID NO: 97] 

Krascdl3wt/P 

5' -gctggtggcgtaggc-3' 

[SEQ ID NO: 98] 

10 

Krascdl3mt/P 

5'- gctggtgacgtaggc -3' 

[SEQ ID NO: 99] 

15 Krascdl3wt/L 

5' -gctggtggcgtaggc-3' 
[SEQ ID NO: 100] 

Krascdl3mt/L 
20 5'- gctggtgacgtaggc -3' 
[SEQ ID NO: 101] 



25 jS Globin separation/diagnostic oligos 



BGloA/D/Cy5 

5 ' - / 5Cy5/acggcagacttctcct caggagtcaggtgcaV 3BioTEG/ - 3 ' 
30 [SEQ ID NO: 102] 

BGloS/D/Cy3 

5' -/5CY/acggcagacttctccacaggagtcaggtgca/3BioTEG/-3' 
[SEQ ID NO: 103] 

35 

BGloAwt/P 

5' -actcctgaggagaag-3 ' 
[SEQ ID NO: 104] 

40 BGloSmt/P 

5' -actcctgtggagaag-3 ' 
[SEQ ID NO: 105] 

BGloAwt/L 
45 5' -actcctgaggagaag-3' 
[SEQ ID NO: 106] 

BGloSmt/L 

5' -actcctgtggagaag-3' 
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[SEQ ID NO: 107] 



EXAMPLE 31 

GENE-SPECIFIC SEPARATION AND PURIFICATION 

Experiments are performed to assess the ability to separate and purify 
plasmid targets from inhomogeneous mixture. Results are tabulated in FIG. 28. 

Two unrelated plasmids - one resistant to kanamycin ("Kan"), one resistant 
to tetracycline ("let") - are mixed in the ratios shown in the first column of FIG. 28. 

Aliquots of each plasmid mixture are separately transformed into 
competent £ cof and aliquots of each pool of transformed bacteria plated on media 
containingeitherkanamycinortetracyline, and resulting colonies counted. Results are 
enumerated in columns 2 and 3 ("Colonies (PreSep)"). 

Another aliquot of each plasmid mixture is relaxed using topoisomerase I 
(In other experiments, not shown, T4 Gene 2 is used to relax the plasmid targets with 
similar results.) The mixture of relaxed closed circular plasmids are then subjected to 
double D-loop based separation and purification using a 31-mer incoming oligonucleotide 
b,obnylated at its 3' terminus and a 20-mer PNA annealing oligonucleofide, both specific for 
a porta of the kanamycin resistance gene. The post-separation mixture is transformed 
<nto E. coli, and aliquots plated on media containing either kanamycin or tetracycline. 
Results are enumerated in columns 4 and 5 ("Colonies (PostSep)"). 

Oligonucleotide sequences are set forth below. The separation protocols 
are similar to those used in Example 15, with differences highlighted below. 

Incoming DMA Oliao ; 
KAN/LDB/31t 

tsEnrN;ri^r at9act999cacaaca9/3BioTEG/ - 3 ' 

lOOnM 

HPLC purification 
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Annealing PNA Oligo: 
KAN/LP/20A 

Ac - gtgccc agt cat agccgaat 
5 [SEQ ID NO: 109] 



10 



40 



Separation Protocol: 



1. Preform DNA oligonucleotide RecA filament. 



Mix: 

15 Incoming oligo (18uM) 2.2 ul 
10X Syn Complete Buffer 4.0 ul 
RecA (125.5 uM) 4 . 0 ul 
H 2 0 23.8 ul 

20 Incubate 37°C, 10 min 

Add: 

2 \xl topoisomerase I relaxed plasmid mixture 
25 (250 ng/jil) 

Incubate 37°C, 20 min 

2. Add: PNA Annealing oligo kan/LP/2 0A (36.2 uM) 

30 

Incubate 3 7°C, 2 min 

3. Stop with 2 ul SDS (10%) 
35 Incubate RT, 1 min 

4. Add: 4 \il 1M KCl, spin 5 min 4C @ 8k rpm. 

5. Add: 10 ^il prepared Dynabeads 280. 
To prepare beads: 



a) Gently resuspend beads, add to 1.5ml 

microfuge tube (10 ul/rxn) . 
45 b) Place tubes on magnet 1 min, remove 

supernatant . 

c) Remove tube from magnet, resuspend in IX 

Syn buffer. 
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10 



15 



20 



25 



30 



e) Repeat steps »c" and «d« 2-3 times. 

6 ' liiTTolltlT^ in microfuge tube @ 4 ° c for 1 hr 

? ' s^aLnt.^^ @ ^ 1 min ' ~™ 

8. Wash 2-3 times with 50 |U 1 X Syn Buffer. 

(Resuspend, place on magnet 1 mi n take off 
supernatant, repeat) ' taKe off 

9 Elute bound plasmid from bead by adding 20 ul 
H 2 0 and heating to 65°C 15 min 

^°^ El r trOPOrate 5 ^ int ° DH10B cell s and plate 

Staa^i SSparatel Y on kanamycin-containing and 
tetracycline -containing media 

Results 

As can be seen from columns 4 and 5 of the table set forth in FIG 28 
biotinylated double D-loops formed using a 31-mer incoming oligonucleotide biotinylated at 
its 3' terminus and a 20-mer PNA annealing oligonucleotide, both specific for a portion of 
the kanamycin resistance gene, allow separation and purification of kan-resistant plasmids 
from an inhomogeneous mixture. Kan*plasmids can be isolated, free from tef 
contamination, even when present at an initial molar ratio of 1 :ip (kan-Ztef). Additional 
experiments (not shown) demonstrate isolation without contamination even when the initial 
molar ratio is 1:106. The data further demonstrate that purification is effected without 
destroying the ability of the plasmids to replicate. 



EXAMPLE 39 

ALLELE-SPECIFIC SEPARATION AND PURIFICATION 

In this experiment, five different related plasmid targets are used, as shown 
in column 1 of the table set forth in FIG. 29. 
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pKAN* is a kanamycin-resistant plasmid having the wild-type kanamycin 

resistance gene. 

PKAN- is identical to pKAN + except for a point mutation that converts the 
wild-type codon "tat" to "tag", inactivating the kanamycin resistance gene. 

5 Each of CN V B1 , CN V D3, and CNV D5 is a pool of plasmids in which the 

mutant "tag" has been targeted, in three separate experiments, for alteration to "tac". "TAC" 
at the relevant codon restores the wild-type phenotype (kanamycin resistance), but with a 
detectably altered genotype, ("tac" rather than "tat"). Targeted alteration is performed using 
modified oligonucleotides according to procedures set forth in U.S. patent application serial 
10 no. 09/818,875, filed March 27, 2001, and in WO 01/73002, the disclosures of which are 
incorporated herein by reference in their entireties. 

All plasmids are ampicillin resistant 

Column 2 shows the percentage of "tac tt -containing ("Kan CNV) plasmids 
within each sample, as determined using the Applied Biosystems SnaPshot™ primer 
1 5 extension genotyping system. 

As expected, neither the pure wild type pKAN + nor the pure pKAN- samples 
contains any plasmids with the "converted" tac codon. In contrast, each of the pools 
targeted for gene alteration contains converted plasmids, with pool B1 having 5%, pool D3 
having 69%, and pool D5 having 10% "tac" (Kan CNV) plasmids, as reported by the 
20 Snapshot system. 

As an independent measure of the percentage of Kan + plasmids, an aliquot 
of each sample is transformed into E. coli and aliquots of the transformed bacteria plated 
separately on kanamycin-containing and ampicillin-containing media, with results as shown 
in columns 3, 4 and 5 ("Colonies (presep)"). 

25 As expected, nearly all plasmids in the pKAN + sample are kan R ; 

conversely, no kan R plasmids are seen among pKAN- transformants. Among the gene- 
altered pools of plasmids, the percentage kan* plasmids is similar but nonidentical to the 
percentages suggested by SnaPshot™ genotyping . 

An aliquot of each of the five samples is then subjected to double D-loop 
3 o based separation and purification of "tac"-containing plasmids, using incoming and 



WO 03/027640 



123 



PCT/US02/31073 



annealing oligonucleotides that are perfectly complementary to the "tac"-containing target 
region. 

Results shown in columns 6, 7 and 8 show that purification of tetR plasmids 
is effected in the pKAN + and all three pools of converted plasmids, but not from the pKAN- 
5 sample. The presence of kanR plasmids in the aliquot purified from the pKAN* sample 
demonstrate that the double D-loops form in this supercoiled plasmid without perfect 
discrimination between the wild-type "tat" sequence and converted "tac" sequence. 

Column 9 gives summary percentages from an experiment in which the 
plasmids are relaxed prior to separation. In contrast to the results with supercoiled 

1 o plasmids, no kan* plasmids are isolated from the pKAN* wild type ("tat") sample. And 

>90% of plasmids obtained from the gene-altered pools of plasmids (B1 , D3, and D5) are 
kanR. 

EXAMPLE 33 

15 SINGLE NUCLEOTIDE DISCRIMINATION IN A YAC TARGET 

In this Example, we test the ability to discriminate single nucleotide 
polymorphisms in human genomic DNA inserts within yeast artificial chromosomes (YACs). 

We use two yeast strains (AB1380), each containing a 248 kb YAC. The 

2 o YACs contain human beta-globin genomic DNA, differing in the p globin allele: one has the 

wild-type p globin allele (B*), the other the sickle cell mutation (6 s ) (Peterson et. a/., Proc. 
Natl. Acad. Sci. USA (1993)). 

We extract genomic DNA, including the YAC DNA, separately from each 
strain. We then form double D-loops in parallel reactions using essentially the same 
25 conditions as described in Example 17. We use either BGIoAWT/5'BioTEG/31 or 

BGIoAWT/5'Cy3/20Me as the incoming oligonucleotide, and either BGIoAWT/5'Cy3/20Me, 
BGIoAWT/20Me, BGIoAWT/L, or BGIoAWT/P, as the annealing oligo. Following double D- 
loop formation and deproteinization, we run the products on a 1% agarose gel and quantify 
the amount of double D-ioop formed using Molecular Dynamics ImageQuant™ and a 

3 o Typhoon™ imager. 
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Incoming Oligonucleotides : 

BGloAWT/5'BioTEG/31 
5 (5 ' - / 5BioTEG/ ACGGCAGACTTCTCCTCAGGAGTCAGGTGCA- 3 ' ) 

[SEQ ID NO: 110] 

BGloAWT/5'Cy5/31 

( 5 ' - /5Cy5 / ACGGCAGACTTCTCCTCAGGAGTCAGGTGCA- 3 ' ) 
10 [SEQ ID NO: 111] 

Annealing Oligonucleotides : 

BGloAWT/ 5 ' Cy 3 / 2 OMe 
15 (5'-/5Cy3 / mAmCmUmCmCmUmGmAmGm - 3 ' ) 

[SEQ ID NO: 112] 

BGloAWT/ 2 OMe 

(5 ' - / 5Cy3 /mAmCmUmCmCmUmGmAmGmGmAmGmAmAmG- 3 ' ) 
2 0 [SEQ ID NO: 113] 

The above two annealing oligonucleotides include 2'-OMe residues, 
indicated by the prefix "m°. 



25 BGloAWT/L 

(5' -+A+C+T+C+C+T+G+A+G+G+A+G+A+A+G-3' ) 
[SEQ ID NO: 106] 

This annealing oligonucleotide includes LNA residues, indicated by the 

30 prefix V. 



BGloAWT/P 

(Ac - ACTCCTGAGGAGAAG) 
[SEQ ID NO: 104] 

This annealing oligonucleotide is entirely composed of PNA residues; an 
acetylated terminus is indicated. 

Results 



40 



We find that we can discriminably detect a single nucleotide polymorphism 
in a clinically relevant human gene within the context of a complex eukaryotic genome (2.8 x 
10 7 bp per diploid genome). 
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EXAMPLE 34 
CONCURRENT FLUORESCENCE DETECTION 



10 



Labeling incoming oligonucleotide with one fluorophore and annealing 
oligonucleotide with another, we demonstrate that we can separately detect the presence of 
each discriminably labeled oligonucleotide present within a double D-!oop (red and green) 
and that we can concurrently detect the presence of both oligonucleotides within a double 
D-loop (yellow). Results are shown in FIG. 30. Not shown, fluoraphores separately present 
on the incoming and annealing oligonucleotide are capable of participating in fluorescence 
resonance energy transfer. 



Target 

15 Linear pAUR 123 hyg (rep) 

Incoming oligonucleotide 



20 



25 



30 



35 



40 



31 -mer with cy5 label 

"hyg" (sequence provided in other examples) 
31 -mer hyg with no label (sequence as above) 
Annealing oligonucleotide 

15 mer hyg all 2 ' -OMe methyl with Cy 3 label 

5, -/5Cy3/mGmUmGmGniAmUmAmGmGmUmCmCmUmGmC - 3M 
[SEQ ID NO: 114] ~ ~ 



15 mer hyg all 2 ' -OMe methyl with Cy 3 label 
Conditions : 

1. 2 jil lOx buffer 

2. l.ll jil incoming 
3 . 2 (xl recA 

(approximately 70 uM, as previously 
described in other examples) 
4. 12,89 jil dH20 
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5. Incubate 10 min @ 50°C 

6. 1 jil linearized pAur 123 hyg (REP) 
(where 1.88 \il is 500 ng) 

7. Incubate 10 min @ 50°C 
5 8. 1 \il annealing 

9. Incubate 10 min @ 50°C 

10. Freeze, add 2 \xl 10% SDS, 8 jal 25% Ficoll 

11. Run 1% agarose gel 3.5 hr 

10 

FIG. 30 Lanes ' 

Lane 1: no annealing, labeled cy5 31 mer 
incoming 

Lane 2: labeled cy5 31 mer incoming, unlabeled 

2'omethyl 15 mer annealing 
Lane 3: labeled cy5 31 mer incoming, labeled cy3 

2 , omethyl 15 mer annealing 
Lane 4: unlabeled cy5 31 mer incoming, labeled 

cy3 2 1 omethyl 15 mer annealing 
Lane 5: no incoming, labeled cy 3 2 1 omethyl 15 
mer 

25 

EXAMPLE 35 

SINGLE NUCLEOTIDE POLYMORPHISM DETECTION IN HUMAN GENOMIC DNA 



Using genomic DNA prepared from cultured cells as target, we 
3 o demonstrate that we can discriminate a single base pair change in a Hyg target integrated 
into human genomic DNA. 



15 



20 



35 



All patents, patent publications, and other published references mentioned 
herein are hereby incorporated by reference in their entireties as if each had been 
individually and specifically incorporated by reference herein. 
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Although illustrative embodiments of the present invention are described 
above, it will be evident to one skilled in the art that various changes and modifications may 
be made without departing from the spirit and scope of the invention. Many examples of 
such modifications have been given through the foregoing specification. It is intended that 
the appended claims cover all such changes and modifications that fall within the true spirit 
and scope of the invention. 
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WHAT IS CLAIMED IS: 



1 . A method of distinguishing the presence of a nonsupercoiled target 
nucleic acid from the presence of nonsupercoiled target variants within a sample of nucleic 
acids, the variants differing from the target by as few as one nucleotide within a common 
target query region, the method comprising: 

using a recombinase to mediate formation of at least one deproteinization- 
stable double D loop in the query region of the target under conditions that favor formation 
at the target query region over formation at variants that differ from the target by as few as 
one nucleotide therefrom; and then 

distinguishing the degree of formation of double D loops that are stable to 
deproteinization, a greater degree of formation distinguishing the presence of target from 
that of variants. 



2. The method of claim 1 , wherein said formation conditions 

comprise: 

contacting the sample with a first oligonucleotide and a second 

oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, said second oligonucleotide includes 
a complementarity region that is perfectly complementary in sequence to a second 
strand of the target across at least a portion of the target query region, and at least 
one of said first or second oligonucleotide complementarity regions is imperfectly 
complementary to the respective first or second strands of the query region of each 
of said target variants, 

wherein said first oligonucleotide is bound by a recombinase and 
said second oligonucleotide comprises base modifications and does not 
substantially bind said recombinase, 
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and wherein at least one of said first and second oligonucleotides 
is distinguishably detectable. 



3. The method of claim 2, further comprising the antecedent step of 
binding a recombinase to said first oligonucleotide. 



4. The method of claim 3, wherein said recombinase is £ cot RecA 
or a mutein thereof. 



5. The method of claim 2, wherein said first oligonucleotide 
complementarity region is no more than 100 nucleotides in length. 

6. The method of claim 5, wherein said first oligonucleotide 
complementarity region is no more than 50 nucleotides in length. 

7. The method of claim 2, wherein said first oligonucleotide is no 
more than 100 nucleotides in length. 



8. The method of claim 7, wherein said first oligonucleotide is no 
more than 50 nucleotides in length. 



9. The method of claim 2, wherein said second oligonucleotide 
comprises base modifications selected from the group consisting of: LNA bases, PNA 
bases, RNA bases, and 2'-OMe bases. 



1 0. The method of claim 9, wherein said second oligonucleotide 
includes at least 30% modified bases. 
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1 1 . The method of claim 10, wherein said second oligonucleotide 
includes at least 50% modified bases. 

1 2. The method of claim 1 1 , wherein said second oligonucleotide 
includes at least 75% modified bases. 

13. The method of claim 2, wherein said second oligonucleotide 
complementarity region is no more than 50 nucleotides in length. 

14. The method of claim 1 3, wherein said second oligonucleotide 
complementarity region is no more than 25 nt in length. 

15. The method of claim 1 4, wherein said second oligonucleotide 
complementarity region is no more than 16 nucleotides in length. 

16. The method of claim 2, wherein said second oligonucleotide is no 
more than 50 nt in length. 

1 7. The method of claim 1 6 f wherein said second oligonucleotide is no 
more than 25 ntin length. 

1 8. The method of claim 1 7, wherein said oligonucleotide is no more 
than 20 nt in length. 



19. The method of claim 2, wherein said first and second 
oligonucleotide complementarity regions overlap by no more than 25 nt. 
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20. The method of claim 1 9, wherein said first and second 
oligonucleotide complementarity regions overlap by no more than 15 nt. 

21 . The method of claim 2, wherein at least one of the first and second 
oligonucleotides includes at least one detectabe label. 

22. The method of claim 21 , wherein said at least one label is selected 
from the group consisting of: a radionuclide, a fluorophore, a fluorescence resonance 
energy transfer tandem fluorophore, a fluorescence resonance energy transfer donor, a 
fluorescence resonance energy transfer acceptor, a mass tag, an enzyme, a genotypic 
label, or a hapten. 

23. The method of claim 22 r wherein said at least one label is a 

fluorophore. 



24. The method of claim 22, wherein said at least one label is a 

genotypic label. 



25. The method of claim 2, wherein said contacting is performed at a 
temperature of at least 37°C. 



26. The method of claim 25, wherein said contacting is performed at a 
temperature of at least 45°C. 



27. The method of claim 26, wherein said contacting is performed at a 
temperature of at least 50°C. 
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28. The method of claim 27, wherein said contacting is at a 
temperature of at least 55°C. 

29. The method of claim 1 , further comprising the step, after forming 
and before distinguishing the degree of formation of said double D loops, of deproteinizing 
the nucleic acids of said sample. 

30. The method of claim 29, wherein the deproteinization step is 
performed at a temperature of at least 37°C. 

31 . The method of claim 30, wherein said deproteinization step is 
performed for no more than about 10 mins. 

32. The method of claim 29, wherein said double D loops are stable for 
a time following deproteinization sufficient to permit detectable separation of said target 
from said variants. 

33. The method of claim 32, wherein said separation is electrophoretic 

separation. 

34. The method of claim 29, wherein said double D loops are stable for 
at least 2 hours at 4°C following deproteinization. 

35 jhe method of claim 34, wherein said double D loops are stable for 
at least 4 hours at 4°C following deproteinization, 
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36. The method of claim 29, wherein said double D loops are stable for 
at least 30 minutes at 37°C following deproteinization. 



37. The method of claim 1 , wherein said nonsupercoiled double- 
stranded target is linear duplex DNA. 



38. The method of claim 1 , wherein said nonsupercoiled double- 
stranded target is a covalently closed circle. 



39. The method of claim 1 , wherein said nonsupercoiled double- 
stranded target is an artificial chromosome. 



40. The method of claim 39, wherein the target query region within 
said artificial chromosome is flanked by recognition sites for a site-specific recombinase. 

41 . The method of claim 1 , wherein the nucleic acids of said sample 
are pooled from a plurality of individuals. 



42. The method of claim 1 , wherein the nucleic acids of said sample 
are from a single individual. 



43. The method of claim 1 , wherein said nucleic acid sample includes 
at least one variant that differs from said target by no more than one nucleotide in said 
query region. 



44. The method of claim 41 , wherein said nucleic acid sample includes 
at least one variant that differs from said target by no more than one nucleotide in the target 
query region. 
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45. The method of claim 44, wherein said at least one variant is a 
naturally-occurring variant of said target. 

46. The method of claim 43, wherein said at least one variant is a 
recombinantly-engineered variant of said target. 

47. The method of claim 25, further comprising the step, after 
deproteinizing and before distinguishing said double 0 loops, of separating the nucleic acids 
that have double D loops from nucleic acids lacking double D loops, 

48. The method of claim 47, wherein at least one of said first and 
second oligonucleotides includes a capture moiety. 

49. The method of claim 48, wherein said capture moiety is biotin. 

50. The method of claim 2, further comprising the step, after double D 
loop formation, of extending by polymerase either or both of the first or second 
oligonucleotides. 

51 . The method of claim 50, wherein said extension is a single base 

extension. 

52. The method of claim 50, wherein said extension amplifies at least a 
portion of the target region. 

53. The method of claim 52, wherein said amplification is isothermal. 
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54. The method of claim 1 , further comprising the step of quantifying 
the absolute or relative abundance of target. 

55. A method of distinguishably detecting the presence of a plurality of 
nonsupercoiled targets within a sample of nucleic acids with selectivity sufficient to 
distinguish each of the plurality of targets from variants that respectively differ by as few as 
one nucleotide therefrom at a query region that is common therebetween, the method 
comprising: 

using a recombinase to mediate formation, separately for each of the 
plurality of targets desired to be detected, of at least one deproteinization-stable double D 
loop in the target's query region, under conditions that favor double D loop formation at the 
target query region over formation at variants that differ from the target by as few as one 
nucleotide therefrom, each target's double D-loop being distinguishably detectable from all 
others of the double D-loops formed in said sample; and then 

distinguishably detecting each of the stable double-D loops so formed. 

56. The method of claim 55, wherein said conditions comprise: 
contacting said sample, for each of the plurality of targets desired to be 

detected, with: 

a first oligonucleotide and a second oligonucleotide, 

wherein said first oligonucleotide includes a 
complementarity region that is perfectly complementary in sequence to a 
first strand of its respective target across the entirety of the target query 
region, said second oligonucleotide includes a complementarity region that 
is perfectly complementary in sequence to a second strand of the same 
target across at least a portion of the target query region, and either or both 
of said oligonucleotide regions is imperfectly complementary in sequence 
to respective first and second strands of the query region of each of the 
other targets desired discriminably to be detected, 
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wherein said first oligonucleotide is bound by a 
recombinase and said second oligonucleotide comprises base 
modifications and does not substantially bind said recombinase, and 

wherein at least one of said oligonucleotides is 
distinguishable from the first and second oligonucleotides used to detect 
each of the others of the plurality of targets desired to be detected. 

57. The method of claim 55, further comprising quantifying the relative 
abundance of each of said targets. 

58. The method of claim 55, wherein at least 10 targets are 
discriminably detected. 

59. The method of claim 58, wherein at least 50 targets are 
discriminably detected. 

60. The method of claim 59, wherein at least 100 targets are 
discriminably detected. 

61 . The method of claim 55, wherein said plurality of targets are 
discriminably detected concurrently. 

62. The method of claim 61 , wherein said targets are detected by 
microarray hybridization. 



63. A method of separating a nonsupercoiled double-stranded nucleic 
acid target from other nonsupercoiled nucleic acids present within a sample of nucleic acids, 
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with selectivity sufficient to separate said target from variants that differ from said target by 
as few as one nucleotide within a common target query region, the method comprising: 

using a recombinase to mediate formation of at least one deproteinization- 
stable double D loop in the query region of the target, under conditions that favor double D 
loop formation at the target query region over formation at variants that differ from the target 
by as few as one nucleotide thereof; and then 

separating nucleic acids having deproteinization-stable double D loops 
from other nucleic acids present within said sample. 



64. The method of claim 63, wherein said formation conditions 

comprise: 

contacting the sample with a first oligonucleotide and a second 

oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, said second oligonucleotide includes 
a complementarity region that is perfectly complementary in sequence to a second 
strand of the target across at least a portion of the target query region, and at least 
one of said first or second oligonucleotide complementarity regions is imperfectly 
complementary to the respective first or second strands of the query region of each 
of said target variants, and 

wherein said first oligonucleotide is bound by a recombinase and 
said second oligonucleotide comprises base modifications and does not 
substantially bind said recombinase. 



65. The method of claim 64, wherein at least one of the first and 
second oligonucleotides includes a capture moiety, and said nucleic acids having 
deproteinization-stable double D loops are separated from other nucleic acids present within 
said sample by capture of said moiety. 
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66. The method of claim 65, wherein said moiety is captured to a solid 

substrate. 

67. The method of claim 66, wherein said solid substrate is a surface 

of a bead. 

68. The method of claim 63, wherein separating said target from the 
nucleic acids in said sample effects at least a 10-fold purification. 

69. The method of claim 68, wherein separating said target effects at 
least a 100-fold purification. 

70. The method of claim 69, wherein separating said target effects at 
least a 1000-fold purification. 

71 . The method of claim 70, wherein separating said target effects at 
least a 10,000-fold purification. 

72. A method of distinguishing the presence of a supercoiled target 
nucleic acid from the presence of supercoiled target variants within a sample of nucleic 
acids, the variants differing from the target by as few as one nucleotide within a common 
target query region, the method comprising: 

using a recombinase to mediate formation of at least one deproteinization- 
stable single- or double- D loop in the query region of the target under conditions that favor 
formation at the target query region over formation at variants that differ from the target by 
as few as one nucleotide therefrom; and then 
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distinguishing the degree of formation of single- or double-D loops that are 
stable to deproteinization, a greater degree of formation distinguishing the presence of 
target from that of variants. 



73, The method of claim 72, wherein said formation conditions 

comprise: 

contacting the sample with a first oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, said region being imperfectly 
complementary to a first strand of the query region of each of said target variants, 

wherein said first oligonucleotide is bound by a recombinase. 

74. The method of 73, further comprising: 

contacting said sample with a second oligonucleotide, 

wherein said second oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a second strand of the target 
across at least a portion of the target query region, 

wherein at least one of said first and second oligonucleotide 
complementarity regions is imperfectly complementary to respective strand of the 
query region of each of said target variants, 

wherein said second oligonucleotide comprises base modifications 
and does not substantially bind said recombinase, 

and wherein said second oligonucleotide is distinguishably 

detectable. 



75. The method of claim 74, further comprising: 
contacting said sample with a third oligonucleotide, 
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wherein said third oligonucleotide comprises base modifications 
and does not substantially bind said recombinase, 

wherein said third oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to at least a portion of the 
second strand of the query region of a target variant as to which the target is 
desired to be discriminated, and 

wherein said third oligonucleotide complementarity region is 
imperfectly complementary in sequence to the complementarity region of said first 
oligonucleotide. 

76. A method of distinguishably detecting the presence of a plurality of 
supercoiled targets within a sample of nucleic acids, with selectivity sufficient to distinguish 
each of the plurality of targets from variants that respectively differ by as few as one 
nucleotide therefrom at a query region that is common therebetween, the method 
comprising: 

using a recombinase to mediate formation, separately for each of the 
plurality of targets desired to be detected, of at least one deproteinization-stable single- or 
double-D loop in the target's query region, under conditions that favor single- or double-D 
loop formation at the target query region over formation at variants that differ from the target 
by as few as one nucleotide therefrom, each target's single- or double D-loop being 
distinguishably detectable from all others of the D-loops formed in said sample; and then 

distinguishably detecting each of the stable double-D loops so formed. 

77. The method of claim 76, wherein said conditions comprise: 

contacting said sample, for each of the plurality of targets desired to be 
detected, with a first oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of its respective 
target across the entirety of the target query region and imperfectly complementary 



WO 03/027640 



141 



PCT/US02/31073 



in sequence to a first strand of the query region of each of the other targets desired 
discriminably to be detected, 

wherein said first oligonucleotide is bound by a recombinase and is 
distinguishable from the first oligonucleotide used to detect each of the others of the 
plurality of targets desired to be detected. 

78. The method of claim 77, further comprising: 

contacting said sample, for each of the plurality of targets desired to be 
detected, with a second oligonucleotide, 

wherein said second oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a second strand of the target 
across at least a portion of the target query region, 

wherein at least one of said first and second oligonucleotide 
complementarity regions is imperfectly complementary to the respective strand of 
the query region of each of said target variants, 

wherein said second oligonucleotide comprises base modifications 
and does not substantially bind said recombinase, 

and wherein said second oligonucleotide is distinguishably 

detectable. 

79. A nucleic acid composition characterized by the presence of at 
least one deproteinized double D loop at a query region within a nucleic acid target, 
wherein: 

the deproteinized double D-loop includes a first and a second 

oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, 
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wherein said second oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a second strand of the target 
across at least a portion of the target query region, 

wherein said first oligonucleotide is bound by a recombinase, 

wherein said second oligonucleotide comprises base modifications 
and does not substantially bind said recombinase, 

and wherein at least one of said first and second oligonucleotides 
is distinguishably detectable. 

80. The composition of claim 79, further comprising at least one variant 
that differs from said target by as few as one nucleotide within a query region that is 
common therebetween, wherein the query region of each of said at least one target variants 
lacks a double D loop. 

81 . A nucleic acid composition characterized by the presence of a 
plurality of deproteinized double D-loops, each of said plurality formed at a query region 
within a respective nucleic acid target, wherein: 

each double D-loop includes a first and a second oligonucleotide, 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, 

wherein said second oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a second strand of the target 
across at least a portion of the target query region, 

wherein said first oligonucleotide is bound by a recombinase, 

and wherein said second oligonucleotide comprises base 
modifications and does not substantially bind said recombinase. 
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82. The composition of claim 81 , wherein at least two of said plurality 
of targets differ in sequence by 1 - 10 nucleotides as between their respective query 
regions. 



83. The composition of claim 82, wherein at least two of said plurality 
of targets differ in sequence by 1 - 5 nucleotides as between their respective query regions. 

84. The composition of claim 83, wherein at least two of said plurality 
of targets differ by exactly 1 nucleotide as between their respective query regions. 



85. The composition of claim 82, wherein said plurality includes at 

least 10 targets. 

86. The composition of claim 85, wherein said plurality includes at 
least 100 targets. 



87. The composition of claim 86, wherein said plurality includes at 
least 1000 targets. 



88. The composition of claim 87, wherein said plurality includes at 
least 10,000 targets. 

89. The composition of any one of claims 81 - 88, wherein said 
plurality of targets are nonsupercoiled. 

90. The composition of any one of claims 81 - 88, wherein said 
plurality of targets are supercoiled. 
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91 . A kit for distinguishing the presence of a target nucleic acid from 
the presence of target variants within a sample of nucleic acids, the variants differing from 
the target by as few as one nucleotide within a common target query region, the kit 
comprising: 

a first oligonucleotide; 

a second oligonucleotide; and 

a RecA-like recombinase; 

wherein said first oligonucleotide includes a complementarity 
region that is perfectly complementary in sequence to a first strand of the target 
across the entirety of the target query region, said second oligonucleotide includes 
a complementarity region that is perfectly complementary in sequence to a second 
strand of the target across at least a portion of the target query region, and at least 
one of said first or second oligonucleotide complementarity regions is imperfectly 
complementary to the respective first or second strands of the query region of each 
of said target variants, 

wherein said first oligonucleotide is capable of being bound by a 
recombinase and said second oligonucleotide comprises base modifications and is 
incapable of substantially binding said recombinase, 

and wherein at least one of said first and second oligonucleotides 
is distinguishably detectable. 

92. The kit of claim 91 , wherein said first oligonucleotide and RecA are 
combined in a single composition. 

93. The kit of claim 91 , wherein at least one of said first and second 
oligonucleotides is detectably labeled. 
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94. The kit of claim 91 , wherein said first oligonucleotide is no more 
than 100 nucleotides in length and said second oligonucleotide is no more than 50 nt in 
length. 



95. The kit of claim 94, wherein said first oligonucleotide 
complementarity region is no more than 50 nucleotides in length and said second 
oligonucleotide complementarity region is no more than 25 nucleotides in length. 



96. The kit of claim 91 , wherein said second oligonucleotide comprises 
base modifications selected from the group consisting of: LNA bases, PNA bases, RNA 
bases, and 2'-OMe bases. 



97. The kit of claim 96, wherein said second oligonucleotide comprises 

LNA bases. 



98. A kit for separately distinguishing the presence of a plurality of 
targets within a nucleic acid sample, with selectivity sufficient to distinguish each of the 
plurality of targets from variants that respectively differ by as few as one nucleotide 
therefrom at a query region that is common therebetween, the kit comprising: 

a RecA-like recombinase; and 

for each target desired to be separately to be distinguished, a pair of 
oligonucleotides, 

wherein the first oligonucleotide of the pair 
includes a complementarity region that is perfectly complementary 
in sequence to a first strand of its respective target across the 
entirety of the target query region, the second oligonucleotide of 
each pair includes a complementarity region that is perfectly 
complementary in sequence to a second strand of the same target 
across at least a portion of the target query region, and either or 
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both of the oligonucleotide regions is imperfectly complementary in 
sequence to respective first and second strands of the query 
region of each of the other targets desired discriminably to be 
detected, 

wherein the first oligonucleotide is capable of 
binding a recA-like recombinase and the second oligonucleotide 
comprises base modifications and is incapable of substantially 
■ binding a RecA-like recombinase, and 

wherein at least one of said oligonucleotides is 
distinguishable from the first and second oligonucleotides used to 
detect each of the others of the plurality of targets desired to be 
detected. 

99. The kit of claim 98, wherein each of said first oligonucleotides is 
combined with RecA. 



1 00. The kit of claim 99, wherein each of said first oligonucleotides is 
combined in a separate composition. 

101. The kit of claim 99, wherein each of said first oligonucleotides is 
combined with RecA in a single, common, composition. 



1 02. The kit of claim 99, wherein at least one of the first and second 
oligonucleotides of each target pair is detectably labeled, the label of each target pair being 
distinguishable from that of each other target pair. 



103. 



The kit of claim 102, wherein each said label is a fluorescent label. 
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104. The kit of claim 102, wherein each said label is a genotypic label. 



105. The kit of claim 98, wherein each of said first oligonucleotides is no 
more than 100 nucleotides in length and each of said second oligonucleotides is no more 
than 50 nt in length. 



106. The kit of claim 105, wherein the complementarity region of each 
said first oligonucleotide is no more than 50 nucleotides in length and the complementarity 
region of each said second oligonucleotide is no more than 25 nucleotides in length. 



107. The kit of claim 106, wherein each said second oligonucleotide 
comprises base modifications selected from the group consisting of: LNA bases, PNA 
bases, RNA bases, and 2'-OMe bases. 



108. The kit of claim 107, wherein each said second oligonucleotide 
comprises LNA bases. 



WO 03/027640 



PCT/US02/31073 



1/31 



INCOMING 0LIG0 



ADDRecA+ATfy-S 



RecA FILAMENT 

WEB 

| ADD TARGET dsDNA 
RecA STABILIZED D-LOOP 

r:^^MbiiiniiM 



J ADD ANNEALING OLIGO 
RecA STABILIZED DOUBLE D-LOOP 




PROTEIN DENATURATION 



COMPLEMENT STABILIZED DOUBLE D-LOOP 



fflllflf 



FIG.1 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT/US02/31073 



2/31 



1 2 3 4 5 6 7 8 9 10 



ft * 




Hp 



AW 



4| 




FIG. 2 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



3/31 



PCT/US02/31073 




SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT/US02/31073 



4/31 



1 2 3 4 5 6 7 8 9 10 




FIG. 4 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



5/31 



PCT7US02/31073 



4° 15° 25° 37° 45° 

* IV, ffif ' 




FIG. 5 



SUBSTITUTE SHEET (RULE 26) 



03/027640 



6/31 



PCT/US02/31073 



4° 15° 25° 37° 45° 50° 





SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 PCT7US02/31073 

7/31 



1 2 3.5 5 7.5 10 20 30 45 60 

- W ■»* *»•■»• <" «* 




FIG. 7 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT/US02/31073 



8/31 




SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT7US02/31073 



9/31 



2 4 5 6 7 8 Annealing Oligo 
V W X Y Z 1 Incoming Oligo 




1 2 3 4 5 6 



FIG. 9 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT/US02/31073 



10/31 



4 4 4 4 4 Annealing Oligo 




1 2 3 4 5 



FIG. 10 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



11/31 



PCT/US02/31073 



none HKILMSPNO Annealing Oligo 




1 2 3456789 10 
FIG. 11 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



12/31 



PCT/US02/31073 



none E F G J Q R noRecA Annealing Oligo 




1 2 3 4 5 6 7 8 



FIG. 12 



SUBSTITUTE SHEET (RULE 26) 



WO 03/027640 



PCT7US02/31073 



13/31 



Oligonucleotide Sequence of the KarTTarget 

1 CAGGGGATCA AGATCTGATC AAGAGACAGG ATGAGGATCG TTTCGCATGA 
51 TTGAACAAGA TGGATTGCAC GCAGGTTCTC CGGCCGCTTG GGTGGAGAGG 
101 CTATTCGGCT ATGACTGGGC ACAACAGACA ATCGGCTGCT CTGATGCCGC 
151 CGTGTTCCGG CTGTCAGCGC AGGGGCGCCC GGTTCTTTTT GTCAAGACCG 
201 ACCTGTCCGG TGCCCTGAAT GAACTGCAGG ACGAGGCAGC GCGGCTATCG 
251 TGGCTGGCCA CGACGGGCGT TCCTTGCGCA GCTGTGCTCG ACGTTGTCAC 
301 TGAAGC 

FIG. 13 
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Oligonucleotide Sequence of the Hyg"Target 

1 cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag 
51 tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 
101 aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 
151 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 
201 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca 
251 aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc 
301 aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata 
351 ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 
401 gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 
451 cagtggcgat aagtcgtgtc ttaccggg 

FIG. 15 
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